-
Schedule Amazon SageMaker notebook jobs and manage multi-step notebook workflows using APIs
Amazon SageMaker Studio offers a managed environment for developing, training, and deploying ML models, with the ability to run notebooks as scheduled jobs. SageMaker Pipelines now includes notebook jobs as a step, enabling data scientists to create complex, multi-step ML workflows. With the Python SDK, these workflows can be programmed and managed via SageMaker Studio,…
-
Announcing new tools and capabilities to enable responsible AI innovation
AWS is focused on responsibly developing generative AI, prioritizing safety, fairness, and security through innovations like Amazon CodeWhisperer with security scanning, Amazon Titan for content management, and privacy with Amazon Bedrock. Collaborations, customer engagement, and new tools like Guardrails and Model Evaluation on Amazon Bedrock enable safe scaling of AI, embedding safeguards against disinformation and…
-
Introduction to Data Manipulation in R with {dplyr}
The {dplyr} package in R is designed for data manipulation, offering functions to filter, sort, and summarize data. One can group data, count distinct values, and strategically create or modify variables with “if else” or “case when” conditions. The package’s ease of use and code readability are highlighted, and chaining operations is efficient with the…
-
Cognitive Biases in Data Science: The Category-Size Bias
A data scientist’s guide to combating category size bias: size doesn’t necessarily correlate with quality or performance. Small models can be effective, accuracy can mask class imbalance, larger datasets don’t always improve predictions, and longer algorithms aren’t inherently better. Awareness and questioning assumptions can mitigate bias.
-
Stability AI explores a potential acquisition amid investor pressures
Stability AI, the company behind Stable Diffusion, is considering a sale amidst investor unrest and financial woes. CEO Emad Mostaque’s leadership has been questioned by investors, including Coatue Management, leading to tensions. Despite releasing impressive tech and achieving unicorn status in 2022, the firm’s high expenses over revenue raise sustainability concerns.
-
DeepMind’s GNoME system discovered millions of new materials
DeepMind’s AI GNoME predicts over 2 million new materials, revolutionizing discovery with deep-learning models and autonomous laboratory A-Lab, enhancing synthesis efficiency and potential applications in various high-tech fields, outlined in a Nature-published study.
-
Introducing the AWS Generative AI Innovation Center’s Custom Model Program for Anthropic Claude
The AWS Generative AI Innovation Center, launched in June 2023, has assisted numerous clients in creating custom AI solutions. Starting Q1 2024, the new Custom Model Program will enable customers to fine-tune Anthropic Claude models with their own data through Amazon Bedrock. The program offers specialized support from AI experts for tailored model optimization.
-
My Fourth Week of the #30DayMapChallange
The author shares their insights from the fourth week of the #30DayMapChallenge, where participants create daily thematic maps, offering analysis on their experience. Read more at Towards Data Science.
-
Charting the Final Frontier: Completing the #30DayMapChallenge Odyssey
The #30DayMapChallenge concluded with participants creating compelling geo-visualizations, demonstrating the power of community and data storytelling. The challenge encompassed various themes like Oceania’s wildlife, global migration flows, traffic patterns, and diamond extraction visualization techniques, highlighting unique data interpretations and the significance of collective creativity throughout the event.
-
Millions of new materials discovered with deep learning
Researchers have discovered 2.2 million new crystals, using GNoME, a deep learning tool that predicts material stability, accelerating discovery time equivalent to 800 years of research.