Artificial Intelligence
A physical neural network has achieved a milestone in machine intelligence by learning and retaining information in a manner similar to human brain neurons. This breakthrough paves the way for the development of efficient and low-energy machine intelligence for complex real-world learning and memory tasks.
Large language models (LLMs) have gained popularity in the AI community as they are seen as a step towards artificial general intelligence (AGI). However, LLMs have limitations, such as dependence on unstructured text and difficulty integrating new knowledge. Researchers are exploring the use of graph-structured data to address these issues. Google Research has conducted investigations…
MATHVISTA is a benchmark to assess the mathematical reasoning abilities of Large Language Models and Large Multimodal Models within visual contexts. It combines various mathematical and graphical tasks and includes existing and new datasets. The benchmark reveals a performance gap compared to humans and emphasizes the need for further advancement in AI agents with mathematical…
This text reviews the current top open-source language models available.
YouTube Music has launched a new feature that allows users to create personalized playlist cover art using generative AI technology. Users can select a theme and specific request, and YouTube’s AI system generates a selection of images to choose from. This feature is currently available to English-language users in the United States but will expand…
Progressive Conditional Diffusion Models (PCDMs) have been introduced by Tencent AI Lab to address the challenges in pose-guided person image synthesis. PCDMs consist of three stages: predicting global features, establishing dense correspondences, and refining images. The method effectively aligns source and target images at multiple levels, producing high-quality and realistic results. It also demonstrates improved…
This article discusses three key questions for junior data scientists to consider when thinking about their future careers. The first question is whether they want to be an individual contributor, a manager, or a combination of both. The second question is whether they want to specialize in areas like machine learning, decision science, or analytics…
Researchers from JPMorgan Chase & Co. conducted an experiment using OpenAI’s GPT-4 model to determine if it could pass the CFA exam. They found that ChatGPT would likely not be able to pass the CFA Levels I and II, while GPT-4 had a decent chance with appropriate prompting. Both models faced challenges with Level II.…
On day two of the AI Safety Summit, UK Prime Minister Rishi Sunak announced that industry leaders such as Meta, Google Deep Mind, and OpenAI have agreed to allow government evaluation of their AI tools before market launch. The summit also established the AI Safety Institute and unveiled a forthcoming “state of AI science” report…
Large language models (LLMs) are being used more frequently as conversational systems, leading to increased reliance on them for answers. To understand how these models respond to questions about ongoing debates, we need datasets with human-annotated labels reflecting contemporary discussions. To address this, we propose a new way of creating a dataset for controversial questions.
Stanford University researchers have introduced EquivAct, a visuomotor policy learning approach that enables robots to generalize tasks across different scales and orientations. The proposed method incorporates equivariance into the visual object representation and policy architecture to ensure robustness across variations in object placements, orientations, and sizes. By using SIM(3)-equivariant network architectures, the learnt policy can…
Researchers from the Universities of Oxford, Münster, Heidelberg, and Exeter have developed innovative photonic-electronic hardware capable of handling three-dimensional (3D) data. This breakthrough significantly enhances the parallelism of data processing for artificial intelligence (AI) tasks. By using radio-frequency modulation, wavelength multiplexing, and non-volatile memories, the team achieved a high level of parallelism, surpassing previous accomplishments.…
Amazon SageMaker Canvas now supports deploying ML models to real-time inferencing endpoints, eliminating the need for manual export, configuration, testing, and deployment. This feature enables users to easily consume model predictions and drive actions outside of the SageMaker Canvas workspace. The process of deploying a model in SageMaker Canvas to a real-time endpoint is explained…
Teachers and students can use a generative AI solution to create course materials and learn English words and sentences. The solution provides real-time assessments and personalized feedback for students. Teachers can generate questions and answers, create images, save assignments to a database, and browse existing assignments. Students can select assignments, answer questions, check grading scores,…
CQuotient, a software startup founded by Rama Ramakrishnan, offers personalized recommendations for retailers by diligently noting down customer interactions. The software has been adopted by Salesforce. Ramakrishnan, now a professor at MIT Sloan, teaches students how to apply AI technologies pragmatically. He also guides senior executives in using pre-trained AI models and understanding different categories…
The potential for AI systems to revolutionize manufacturing is discussed by Ritu Jyoti, global AI research lead at IDC. Windmill manufacturers have employed AI to improve their processes, using digital twins and machine learning to simulate and implement enhancements. The manufacturing sector is urged to embrace AI to enhance efficiency, safety, and sustainability. However, a…
Stability AI has unveiled new additions to its text-to-image products, including Sky Replacer, Stable 3D, and Stable FineTuning. Sky Replacer allows users to replace the sky in a photograph with preset templates, while Stable 3D generates concept-quality textured 3D objects based on text or images. Stable FineTuning enables customization of pictures, objects, and styles. The…
GlotLID-M is a Language Identification (LID) model that supports 1665 languages, including low-resource languages. It addresses challenges such as inaccurate corpus metadata, leakage from high-resource languages, difficulty distinguishing closely related languages, macrolanguage vs. varieties handling, and handling noisy data. GlotLID-M outperformed several baseline models in terms of accuracy and can be easily incorporated into datasets.
This week’s Variable highlights recent articles from the Tips & Tricks column of Towards Data Science. The articles offer actionable advice for data scientists to save time and produce better results in their projects. Topics include streamlining repetitive tasks in data analysis, using Pydantic V2 for data validation, common index-related operations in Pandas, color usage…
A new cloud observability platform called Middleware.io has been introduced to address the need for efficient troubleshooting and issue management in cloud-native applications. Leveraging AI-driven anomaly detection and resolution, the platform offers real-time monitoring, predictive capabilities, and a user-friendly dashboard. With its recent seed funding, Middleware.io is poised for expansion and further AI development, promising…