Large language model
In November 2022, OpenAI’s ChatGPT saw rapid growth, reaching a million users in 5 days, then soaring to 100 million by January 2023. In April 2023, the user count hit 173 million, with over 1.5 billion monthly website visits by January 2024. The U.S. and India have the highest user bases. Additionally, the platform is…
Elon Musk announced the first successful human trial of Neuralink’s brain implant, “Telepathy,” allowing control of devices simply through thought. Targeting individuals with limited hand mobility, the implant aims to restore autonomy and unlock human potential. The fusion of AI and brain-machine interfaces could revolutionize communication speed and capability, paving the way for an inevitable…
IBM Research introduces Unitxt, a collaborative platform for processing unified textual data, offering a Python module with configurable pipelines for handling textual data in multiple languages. This facilitates collaboration, transparency, and reproducibility. Unitxt allows for over 100,000 recipe configurations, facilitates integration of datasets, and serves as a crucial data backbone for large language models.
Researchers from the College of Computer Science, Sichuan University, and the Engineering Research Center of Machine Learning and Industry Intelligence, Ministry of Education Chengdu, China, have introduced DREditor, a time-efficient method for adapting dense retrieval models to specific domains. DREditor achieves 100-300 times faster time efficiency and extends applicability to domain-specific scenarios. [50 words]
Current multi-modal language models face limitations in performing complex visual reasoning tasks, requiring a blend of low-level object motion analysis with high-level spatiotemporal reasoning. Research in this area is advancing with models like Pix2seq, VideoChatGPT, and the LRR model by Qualcomm AI Research, which shows superior performance in video reasoning tasks. The LRR model’s “Look,…
Researchers from Peking University, Pika, and Stanford University have introduced RPG, a novel state-of-the-art framework for text-to-image conversion. RPG utilizes multimodal Large Language Models (MLLMs) to enhance compositionality, precision, and flexibility. It demonstrates superior performance over existing models, particularly in handling complex text prompts involving multiple objects and relationships. Learn more in the research paper…
Artificial Intelligence, particularly deep learning, has transformed various fields, including medical imaging. Stanford University and Stability AI have introduced CheXagent, an instruction-tuned FM for CXR interpretation with a comprehensive evaluation framework, CheXbench. CheXagent demonstrated superior performance in various CXR interpretation tasks, showing potential to enhance clinical decision-making in medical imaging.
Microsoft is poised for its best quarterly growth in nearly two years, with a projected 15.8% revenue rise. Its alliance with OpenAI has propelled it to a $3 trillion valuation, establishing dominance in AI. Analysts project strong growth for Azure due to increased demand for AI services, despite competition from AWS and Google Cloud.
The Biden administration is compelling cloud service providers to disclose foreign users developing AI technologies, particularly in China. This aims to restrict access to essential data centers and servers and curb perceived malicious cyber-enabled activities. US-China tensions in AI escalate, with the US enforcing strategies to maintain a technological edge and national security.
Researchers have created a robotic sensor with AI that can read braille at double the speed of human readers.
The text is an urgent message to Taylor, encouraging her to take action against nonconsensual deepfake porn. It describes the disturbing rise of deepfake technology, its impact on women and marginalized groups, and the lack of effective solutions. The author urges Taylor to use her influence to push for regulations and real change in the…
The development of Large Language Models (LLMs) in the field of Artificial Intelligence (AI) has shown significant progress, particularly in understanding and generating natural language. Challenges in managing non-English languages led to the creation of MaLA-500, a new LLM covering 534 languages, addressing data scarcity and linguistic variation. The model’s adaptability proves its significance in…
MambaByte, a byte-level language model developed by Cornell University researchers, revolutionizes language models by efficiently managing lengthy byte sequences without traditional tokenization. It significantly outperforms MegaByte, showcasing superior efficiency and results with fewer computational resources. This breakthrough hints at an exciting future for token-free language modeling in natural language processing.
Physicists have developed a new type of neural network using active colloidal particles instead of electricity. This physical system shows promise for artificial intelligence and time series prediction, offering an alternative to traditional microelectronic chip-based digital calculations.
Millions witnessed nonconsensual deepfake pornography of Taylor Swift on social media platform X, prompting the platform to block searches for her. Generating deepfakes with AI has made it easier to sexually harass people. The fight against nonconsensual deepfakes includes using watermarks, protective shields for images, and stricter regulations to hold perpetrators accountable and protect victims.
In September 2022, former Google AI experts Noam Shazeer and Daniel De Freitas released Character.AI, an advanced chatbot. By May 2023, the app had over 1.7 million downloads and high user engagement. As of 2024, it boasts 20 million global users, with 60% aged 18-24. The app’s success raises concerns about mental health support and…
The late comedian George Carlin’s estate is suing the creators of an AI-generated video impersonating Carlin, claiming copyright infringement and violation of Carlin’s right to publicity. It was initially believed that the show was created by an AI, but the creators have stated that it was actually written by a human. The lawsuit raises questions…
Researchers from The Wharton School explored methods to enhance GPT-4’s creativity in idea generation. Experimenting with various prompting strategies, they found that longer prompts and Chain of Thought (CoT) instructions resulted in more diverse ideas. While GPT-4’s ideas were initially similar, strategic prompting improved diversity, making it a valuable tool in brainstorming.
Research on fitness landscapes in evolutionary biology explores the challenge of mapping and understanding the relationship between genotypes and an organism’s fitness. Conventional methods for assessing this complex relationship are limited, prompting the use of deep learning models to predict and analyze fitness. These innovative approaches offer a more scalable and efficient means of studying…
The web is a vast source of knowledge constantly changing, posing challenges for accurate information retrieval. Language models like chatGPT add complexity, leading to research on Retrieval Augmented Language Models (RALMs). San Jose State University proposed TempRALM, using temporal retrieval to enhance Atlas, outperforming it by 74% with fewer resources. Future applications include fact-checking and…