-
ETH Zurich Researchers Introduce UltraFastBERT: A BERT Variant that Uses 0.3% of its Neurons during Inference while Performing on Par with Similar BERT Models
UltraFastBERT, developed by researchers at ETH Zurich, is a modified version of BERT that achieves efficient language modeling with only 0.3% of its neurons during inference. The model utilizes fast feedforward networks (FFFs) and achieves significant speedups, with CPU and PyTorch implementations yielding 78x and 40x speedups respectively. The study suggests further acceleration through hybrid…
-
Introducing three new NVIDIA GPU-based Amazon EC2 instances
Amazon announces the expansion of its EC2 accelerated computing portfolio with three new instances powered by NVIDIA GPUs: P5e instances with H200 GPUs, G6 instances with L4 GPUs, and G6e instances with L40S GPUs. These instances provide powerful infrastructure for AI/ML, graphics, and HPC workloads, along with managed services like Amazon Bedrock, SageMaker, and Elastic…
-
New method uses crowdsourced feedback to help train robots
A novel technique allows an AI agent to use data crowdsourced from nonexpert human users to learn and complete tasks through reinforcement learning. This approach trains the robot more efficiently and effectively compared to other methods.
-
AI-generated sexually explicit material is spreading in schools
Children in the UK are using AI image generators to create indecent images of other children, according to the UK Safer Internet Centre (UKSIC). The charity has highlighted the need for immediate action to prevent the problem from spreading. The creation, possession, and distribution of such images is illegal in the UK, regardless of whether…
-
“Authentic” the Merriam-Webster word of the year, but why?
Merriam-Webster has chosen “authentic” as its Word of the Year for 2023 due to its increased relevance in the face of fake content and deep fakes. The word has multiple meanings, including being genuine and conforming to fact. This decision reflects the current crisis of authenticity in a world where trust is challenged by the…
-
Boost inference performance for LLMs with new Amazon SageMaker containers
Amazon SageMaker has released a new version (0.25.0) of Large Model Inference (LMI) Deep Learning Containers (DLCs) with support for NVIDIA’s TensorRT-LLM Library. This upgrade provides improved performance and efficiency for large language models (LLMs) on SageMaker. The new LMI DLCs offer features such as continuous batching support, efficient inference collective operations, and quantization techniques.…
-
Unveiling the Frontiers of Scientific Discovery with GPT-4: A Comprehensive Evaluation Across Multiple Disciplines for Large Language Models
Language models like GPT-4, which are part of the field of Artificial Intelligence, have gained popularity due to their remarkable capabilities in various fields. These models excel in tasks such as coding, mathematics, law, and understanding human intentions. GPT-4 can process text, images, and even display characteristics of Artificial General Intelligence (AGI). Recent research has…
-
UK and US develop new global guidelines for AI security
UK and US cyber security agencies have developed guidelines to enhance the security of AI systems. The guidelines focus on secure design, development, deployment, and operation, aiming to prevent cybercriminals from hijacking AI and accessing sensitive data. While the guidelines are non-binding, they have the endorsement of 16 countries. However, the prevalence of zero-day vulnerabilities…
-
Microsoft Releases Orca 2: Pioneering Advanced Reasoning in Smaller Language Models with Tailored Training Strategies
Microsoft introduces Orca 2, an advanced reasoning model for smaller language models. Unlike traditional imitation learning, Orca instructs models in different reasoning techniques to improve their reasoning and comprehension skills. Orca 2 outperforms other models in various language tasks and achieves high accuracy. The departure from imitation learning showcases a new approach to unlocking the…
-
Courage to learn ML: Demystifying L1 & L2 Regularization (part 1)
L1 and L2 regularization are techniques used in machine learning to prevent overfitting. Overfitting occurs when a model is too complex and learns from both the underlying patterns and the noise in the training data, resulting in poor performance on unseen data. L1 and L2 regularization add penalty terms to the model’s loss function, discouraging…