Large language model
Facebook AI Research (FAIR) introduces Habitat 3.0, a virtual training ground for building AI agents that understand their environment and collaborate with humans. Habitat 3.0 allows robots and virtual humans to complete tasks in a digital environment, providing a safer and faster alternative to real-world training. FAIR also released the Habitat Synthetic Scenes Dataset (HSSD-200)…
China’s Zhipu AI, a startup founded by a professor from Tsinghua University, has raised 2.5 billion yuan ($340 million) in funding. The company has released a bilingual AI model, ChatGLM-6B, that understands Chinese and English, as well as a larger open-source model, the GLM-130B. Zhipu AI aims to compete with global AI giants and has…
Google recently introduced a search feature called Search Generative Experience (SGE), which uses generative AI to provide summarized answers to search queries. While Google aims to improve user experience, media publishers are concerned about the lack of credit and compensation for their content. SGE’s summaries sometimes use information from publishers’ websites without proper accreditation. Publishers…
This week’s AI news highlights various topics. Google and Cambridge’s Centre for Human-Inspired AI collaborate to make AI safer. China and the UK hold AI Summit despite recent tensions. Baidu claims Ernie Bot matches GPT-4. AI can extract personal data from chat interactions, while AI companies struggle with transparency and alignment guardrails. AI aids in…
ChatGPT, the popular AI tool, has gained significant popularity. While the free version, ChatGPT 3.5, has limitations, there are ways to access the ChatGPT Plus (GPT-4) version for free. Options include using Bing AI Chat, Hugging Face Spaces, Poe AI, Phind, and Ora AI. Each platform has its own usage restrictions and features.
Large language models, such as GPT, have shown exceptional performance in text-related tasks. However, efforts are being made to teach them how to comprehend and use other forms of information, such as sounds and images. Microsoft researchers have developed DeepSpeed-VisualChat, an advanced framework that enhances multi-modal capabilities and scalability in dialogue systems. The framework uses…
Recent advancements in human motion capture have made it possible to capture motion from RGB photos and films using affordable devices. This opens up opportunities for motion capture in various industries, including sports. However, there are challenges in using computer vision-based motion capture for swimming due to the unique nature of aquatic data. Researchers have…
Companies are increasingly using user-generated images and videos for engagement, but managing inappropriate content can be a challenge. Amazon Rekognition offers pre-trained and customizable AI capabilities for content moderation. With the new Custom Moderation feature, companies can enhance the accuracy of the moderation model and tailor it to their specific needs. The feature allows for…
The text discusses the HyperHuman framework for generating hyper-realistic human images. It utilizes a large dataset and a Latent Structural Diffusion Model to improve image quality and coherence. The framework demonstrates superior performance and robustness compared to previous models. Future research can explore text-to-pose generation using deep priors.
A study published in Intelligent Computing introduces a new method called edge-sensitive single-pixel imaging (ESI) for detecting object edges even when obtaining clear images through standard optical methods is challenging due to factors like severe light pollution. The ESI method extracts edge information by illuminating an object with carefully crafted modulation patterns, bypassing the need…
This text is about using Python to analyze the geospatial data from the International Union for Conservation of Nature (IUCN).
GPT-4 is the latest language model developed by OpenAI, known for its accuracy and safety. It can process various formats such as images, PDFs, and CSVs. Other AI tools mentioned include Bing AI for accurate answers, DALL-E 2 for text-to-image generation, Adobe Firefly for image editing, and many more.
Music publishers, including Universal Music, ABKCO, and Concord Publishing, have filed a lawsuit against Anthropic in Tennessee federal court. The lawsuit accuses Anthropic of misusing copyrighted song lyrics to train its chatbot Claude, infringing upon the publishers’ rights. Examples of songs mentioned in the lawsuit range from the Beach Boys’ “God Only Knows” to Mark…
The NYPD has partnered with tech company Truleo to use AI to analyze police body-worn camera footage. Truleo’s software categorizes officers’ language and scores interactions as “professional” or “unprofessional.” Meanwhile, in the UK, there are plans to roll out facial recognition technology in shops, despite concerns about privacy and bias.
The text discusses the challenges of building anomaly detection models using high-resolution imagery and proposes a two-stage approach to overcome these challenges. It describes the training process for a Rekognition Custom Labels model and presents the results of experiments conducted using one-stage and two-stage models to detect missing holes in PCBs. The two-stage model outperformed…
Anthropic, the company behind the AI chatbot Claude, conducted an experiment involving around 1,000 Americans to explore the idea of letting ordinary people shape the rules that govern AI behavior. By allowing public input, Anthropic aims to bridge the gap between public opinion and the AI industry. The experiment resulted in a “Collective Constitutional AI”…
A safety mitigation stack was created for the wider release of DALL·E 3. Updates on provenance research will be shared.
AI models like GPT-4, used by companies such as OpenAI and Meta, can infer personal information from our online chats and comments, even when we think we’re not revealing anything personal. Researchers found that GPT-4 could accurately infer attributes like age, education, sex, occupation, and more from Reddit comments. This has implications for privacy and…
The text explains how to summarize text effectively and accurately.
A new diffusion-based continuous GNN model has been developed that improves generalization capabilities.