-
Alibaba Researchers Propose Reward Learning on Policy (RLP): An Unsupervised AI Framework that Refines a Reward Model Using Policy Samples to Keep it on-Distribution
-
SineNet by Texas A&M University and the University of Pittsburgh Innovates PDE Solutions: Addressing Temporal Misalignment in Fluid Dynamics Through Deep Learning
-
Researchers at Stanford and Databricks Open-Sourced BioMedLM: A 2.7 Billion Parameter GPT-Style AI Model Trained on PubMed Text
-
Teaching SOLAR to Shine: How Upstage AI’s sDPO Aligns Language Models with Human Values
-
Modular Open-Sources Mojo: The Programming Language that Turns Python into a Beast
-
Meet Deep-Seek: An Open Source Research Agent Designed as an Internet Scale Retrieval Engine
-
OA-CNNs: A Family of Networks that Integrates a Lightweight Module to Greatly Enhance the Adaptivity of Sparse Convolutional Neural Networks CNNs at Minimal Computational Cost
-
Layerwise Importance Sampled AdamW (LISA): A Machine Learning Optimization Algorithm that Randomly Freezes Layers of LLM Based on a Given Probability
-
Mistral AI Releases Mistral 7B v0.2: A Groundbreaking Open-Source Language Model
-
ChatGPT vs Perplexity AI: AI App Comparison