Large language model
Introducing GraphCast, an advanced AI model capable of providing highly accurate medium-range weather forecasts, setting a new standard in forecasting accuracy.
TSMixer architecture is explained and can be implemented in Python for long-term multivariate forecasting tasks.
Chinese researchers have developed an AI robot chemist that can potentially extract oxygen from Martian resources. By using Martian materials to create catalysts that release oxygen from water, this technology represents a significant advancement in space exploration and resource utilization. The robot chemist achieved this process much faster than a human researcher, and its ability…
OpenAI has introduced a new feature called “Create a GPT” in ChatGPT, allowing users to create custom versions of ChatGPT for specific tasks or interests. Users can train ChatGPT on their own data without the need for coding expertise. OpenAI plans to launch a GPT Store where users can publish their custom GPTs and potentially…
ChatGPT and OpenAI’s API experienced periodic outages on 8 November due to a distributed denial-of-service (DDoS) attack. Hacktivist group Anonymous Sudan claimed responsibility, citing OpenAI’s cooperation with Israel and bias in ChatGPT. Other OpenAI models, Bard and Claude, also faced capacity constraints. The incident highlights the vulnerability of society as we become increasingly dependent on…
The article discusses Retrieval-Augmented Generation (RAG), which is a concept that provides additional information from an external knowledge source to large language models (LLMs). The article explains the problem of factual inaccuracies that can occur when prompting LLMs and presents RAG as a solution. It also provides an implementation example using LangChain for orchestration, OpenAI…
Facebook’s introduction of the News Feed in 2006 revolutionized the platform, providing users with a constantly updating stream of posts and status changes. Despite user complaints, engagement doubled. The company then implemented an algorithm called EdgeRank to prioritize content based on factors like age, engagement, and user connections. As Facebook embraced machine learning, it faced…
Fei-Fei Li, co-director of Stanford’s Human-Centered AI Institute, believes we are in an inflection moment for AI. Generative AI has caused the public to wake up to AI technology, leading to more businesses implementing AI in real-world products. Li discusses the risks of AI, the flaws of ImageNet, the role of data, and offers tips…
The text discusses the proposal of the asymmetric certified robustness problem for deep learning classifiers, which addresses the vulnerability of these classifiers to adversarial examples. It introduces feature-convex classifiers as a solution to this problem, providing closed-form and deterministic certified radii for inputs. The text also highlights the theoretical promise of input-convex classifiers achieving perfect…
Researchers from Emory University and Georgia Institute of Technology have developed CLINGEN, a generic framework for generating high-quality clinical texts in few-shot situations. By combining clinical knowledge extraction from knowledge graphs and large language models, CLINGEN improves the variety and distribution of synthetic clinical data. Experimental results show consistent performance increases across multiple tasks.
Researchers from ETH Zurich have proposed modifications to simplify transformer blocks in deep neural networks without compromising training speed or performance. By combining signal propagation theory and empirical observations, they explored the removal of various components from standard transformer blocks. The proposed simplified transformers achieved comparable performance to standard transformers while using fewer parameters and…
GitLab has launched Duo Chat, a new tool integrated into its developer platform that aims to simplify the developer experience by leveraging conversational AI. The tool allows developers to have natural language conversations with the AI, providing code explanations, generating tests, and simplifying coding tasks. GitLab emphasizes a privacy-first approach and aims to make AI…
Researchers from Microsoft Mixed Reality & AI Lab have introduced a groundbreaking approach called HMD-NeMo (HMD Neural Motion Model) that generates accurate full-body motion in immersive mixed-reality scenarios, even when hands are only partially visible. HMD-NeMo uses a spatiotemporal encoder with novel mask tokens to encourage plausible motion, and it operates in real-time and online.…
The SEINE model is a short-to-long video diffusion model that generates high-quality extended videos with smooth and creative transitions between scenes. It focuses on generating intermediate frames between two different scenes to achieve seamless transitions. The model incorporates a random mask module and takes into account both visual and textual input to enhance the controllability…
A team of researchers at the University of Science and Technology of China has developed an AI robot that uses Martian meteorite extracts to produce oxygen. The robot created a catalyst from the Martian rock samples to accelerate the process of extracting oxygen from water. This breakthrough demonstrates the potential of AI in supporting space…
Nvidia has announced the H200, a high-end chip designed for training AI models, with enhanced performance in inference. The chip is expected to be shipped in the second quarter of 2024 and will be compatible with existing systems using the H100. Nvidia’s stock has seen a 230% increase in 2023 due to the excitement around…
Cerebras Systems and G42 have achieved a significant milestone in the field of artificial intelligence with the completion of a 4-Exaflop AI supercomputer. This partnership showcases their technical expertise and commitment to innovation. They are now aiming to reach 8-Exaflops, which will have far-reaching implications for AI research and various industries. This achievement highlights the…
Researchers from Waabi and the University of Toronto have developed LabelFormer, a transformer-based AI model that efficiently refines object trajectories for auto-labelling. This technique improves the accuracy of bounding boxes by utilizing the entire time context and outperforms window-based approaches in terms of computing efficiency. The comprehensive experimental assessment demonstrates the effectiveness of LabelFormer in…
Rosalyn has introduced StableSight, an advanced AI system to tackle academic dishonesty in online education. It features gaze-tracking and keyboard sound analysis to detect cheating methods like secondary screens and concealed devices. The platform identifies suspected cases of cheating and involves human reviewers for fair assessment. StableSight has gained recognition from organizations and aims to…
Amazon Personalize is a machine learning technology that enables businesses to provide personalized recommendations to their customers. It simplifies the integration of personalized recommendations into websites, applications, and email marketing systems. With Amazon Personalize, businesses can easily train models, process data, and generate real-time recommendations. The solution can be implemented using various AWS services such…