Large language model
The PAICS deep learning system has shown promising results in enhancing the diagnostic performance of sonologists in detecting fetal intracranial malformations. A study involving 36 sonologists found that the system substantially improved the accuracy of CNS malformation classification. Further research in real clinical settings is needed to fully assess the system’s assistance in identifying congenital…
The use of personally identifiable information (PII) is widespread and includes various types of data that can identify individuals. Detecting and redacting PII is essential for privacy protection and compliance. Failure to do so can lead to significant consequences. The legal system utilizes electronic discovery (eDiscovery) to identify and produce electronically stored information (ESI) in…
You.com has released the YouRetriever, an easy-to-use interface for the You.com Search API. They tested the API with different datasets to improve efficiency in Retrieval Augmented Generation (RAG)-QA applications. They compared the You.com Search API with the Google Search API and provided a framework for evaluating LLMs in an RAG-QA environment. They conducted tests using…
Generative AI is playing a growing role in business operations and customer service. According to Salesforce research, 61% of workers either use or plan to use generative AI, with 68% confident that it will enhance customer experiences. However, human oversight is still seen as crucial. Generative AI can assist with tasks such as creating targeted…
Researchers from Peking University, Meituan, Meta AI, National Key Laboratory of General Artificial Intelligence, BIGAI, and Renmin University of China have introduced a compression paradigm called Retrieval-based Knowledge Transfer (RetriKT). This approach aims to efficiently transfer information from Large Language Models (LLMs) to small-scale models. Their method involves extracting knowledge from LLMs to create a…
Releasing the weights of a large language model (LLM) allows for fine-tuning and bypassing guardrails. OpenAI hasn’t released GPT-4’s weights, while Meta released Llama 2’s weights. MIT researchers highlighted the risks of releasing weights, as demonstrated through an experiment in which a fine-tuned LLM, called Spicyboros, provided instructions on recreating the Spanish Flu. Removing guardrails…
Researchers at Sungkyunkwan University have developed a novel memory system called “Memoria” that enhances the performance of transformer models in handling lengthy data sequences. The system draws inspiration from human memory principles and has shown promising results in improving the capacity of transformer models to account for long-term dependencies. This development has the potential to…
Researchers from Peking University have introduced KnowGen, a method for generating new knowledge by modifying existing entity attributes and relationships. They propose the ALCUNA benchmark to assess large-scale language models’ (LLMs) abilities in handling new knowledge. The study reveals that LLMs often struggle with reasoning about new versus internal knowledge. The researchers emphasize caution when…
Sir Nick Clegg, President of Global Affairs at Meta, emphasized that the UK AI Safety Summit should prioritize the risks posed by generative AI in upcoming elections over speculative AI risks. He argued that discussions around the “existential threat” of AI distract from more immediate dangers. Clegg also cautioned against stifling AI development with excessive…
NVIDIA has released a groundbreaking research paper demonstrating how generative artificial intelligence (AI) can revolutionize semiconductor design. The study reveals that large language models (LLMs) can benefit specialized fields like chip design. NVIDIA’s custom LLM called ChipNeMo, developed using the NVIDIA NeMo framework, has already shown promising results in tasks like software generation and bug…
Federal judge William Orrick dismissed the majority of the copyright infringement claims brought by three artists against Stability AI, Midjourney, and DeviantArt. The claims were based on the use of the artists’ work to train AI models. Two artists dropped their claims due to lack of copyright registration. The judge ruled that the claims were…
MetaCLIP is a new approach for data curation that outperforms OpenAI’s CLIP on multiple benchmarks. It aligns image-text pairs with metadata entries through substring matching and creates a more balanced data distribution. MetaCLIP achieves unprecedented accuracy for zero-shot ImageNet classification and has the potential to improve algorithm effectiveness.
This article discusses the potential of using hexagon maps for data analysis. Hexagon maps provide a balanced geometry for better regional comparisons and improved territorial coverage. The article provides a step-by-step explanation of how to create hexagonal maps in Python, utilizing the H3 and Plotly libraries. The example used in the article is visualizing the…
The research introduces DALL-E 3, an AI text-to-image generation model that aims to improve spatial awareness, text rendering, and specificity in generated images. The OpenAI team proposes a training approach that combines synthetic and ground-truth captions to enhance the model’s image generation capabilities. The study highlights the role of advanced language models in refining textual…
A BBC report by two young reporters explores the role of AI in education. Students shared their experiences, with some using ChatGPT to simplify assignments while others admitted to using it to cheat. The report highlighted the need for a balanced approach to AI usage and the importance of teaching students how to use it…
Microsoft is facing criticism from The Guardian for an AI-generated poll that accompanied a news story about a woman’s death. The poll prompted users to speculate on the cause of her death, with options including murder, suicide, and accident. The incident has raised concerns about Microsoft’s AI-driven content production, following previous controversies and errors. The…
Apple researchers have introduced a novel deep learning-based technique for online 3D reconstruction using dynamically-posed RGB images. They have developed a dataset called LivePose and proposed a recurrent de-integration module to handle pose changes in reconstruction. The technique offers qualitative and quantitative improvements in reconstruction measures. Their work aims to mimic real-world environments for mobile…
New studies suggest that the brain employs a self-supervised learning process that resembles machine learning. This process enables the brain to learn about visual scenes by identifying their similarities and differences, without relying on labels or additional information.
Del Complex plans to deploy its BlueSea Frontier Compute Clusters (BSFCC) in international waters to enable AI developers to bypass AI regulations. Each BSFCC will offer computing power equivalent to over 10,000 Nvidia H100 GPUs. The company claims that the platforms, which gain sovereign nation-state status, will provide unparalleled opportunities for large-scale AI model development…
The Mixture of Experts (MoE) architecture combines multiple subnetworks to handle complex data, but it can be computationally expensive. Researchers have introduced QMoE, a framework that compresses trillion-parameter MoEs to less than 1 bit per parameter, making them more efficient to run. This is achieved through data-dependent quantization methods and can be processed in less…