Deploy and fine-tune foundation models in Amazon SageMaker JumpStart with two lines of code

The Amazon SageMaker JumpStart SDK has been simplified for building, training, and deploying foundation models. The code for prediction is now easier to use. This post demonstrates how to get started with using foundation models using the simplified SageMaker Jumpstart SDK in just a few lines of code. You can find more information about the SDK for deployment and training in the documentation. SageMaker JumpStart provides pre-trained models and solution templates for various problem types, as well as example notebooks. You can access these resources through the SageMaker JumpStart landing page or use the SageMaker Python SDK. The post also provides instructions on how to deploy and invoke the models, as well as customize the new classes in the SageMaker SDK. Overall, the simplified SageMaker JumpStart SDK offers an easy way to work with task-based and foundation models, and supports customization for specific requirements.

Simplified SageMaker JumpStart SDK for Building, Training, and Deploying Models

We are excited to introduce a simplified version of the Amazon SageMaker JumpStart SDK that makes it easy to build, train, and deploy foundation models. With just a few lines of code, you can get started with using pre-trained models and fine-tune them for your specific tasks.

Solution Overview

SageMaker JumpStart provides pre-trained, open-source models for various problem types, allowing you to quickly start your machine learning (ML) projects. You can incrementally train and fine-tune these models before deployment. JumpStart also offers solution templates and example notebooks for common ML use cases.

To demonstrate the capabilities of the new SageMaker JumpStart SDK, we show you how to use the pre-trained Flan T5 XL model from Hugging Face for text generation and summarization tasks. You can also use other models like Llama2, Falcon, or Mistral AI for text generation.

To deploy the model, you can use the model ID provided for each pre-trained model. In this example, we use the model ID for the Flan T5 XL model. After deployment, you can easily invoke the model to generate summaries of text.

Deploy and Invoke the Model

To deploy the Flan T5 XL model, you can use the simplified SageMaker JumpStart SDK. Simply instantiate the model object with the model ID and call the deploy method. Here’s an example:

from sagemaker.jumpstart.model import JumpStartModel

pretrained_model = JumpStartModel(model_id="huggingface-text2text-flan-t5-base")
pretrained_predictor = pretrained_model.deploy()

Once the model is deployed, you can invoke it by passing the text to the predictor. The response from the model will be returned as a Python dictionary. Here’s an example:

text = "Summarize this content - Amazon Comprehend uses natural language processing (NLP) to extract insights about the content of documents..."
query_response = pretrained_predictor.predict(text)
print(query_response["generated_text"])

This will generate a summary of the provided text using the Flan T5 XL model.

Fine-tune and Deploy the Model

The SageMaker JumpStart SDK also provides a JumpStartEstimator class for simplified fine-tuning. You can provide the location of your fine-tuning data and optionally pass validation datasets. After fine-tuning, you can deploy the model using the deploy method of the Estimator object. Here’s an example:

from sagemaker.jumpstart.estimator import JumpStartEstimator

estimator = JumpStartEstimator(
    model_id=model_id,
)
estimator.set_hyperparameters(instruction_tuned="True", epoch="3", max_input_length="1024")
estimator.fit({"training": train_data_location})
finetuned_predictor = estimator.deploy()

Customize the New Classes in the SageMaker SDK

The new SDK allows you to customize the deployment and invocation based on your requirements. You can override the defaults and customize parameters such as instance type, VPC configuration, and more. Here’s an example of overriding the instance type:

finetuned_predictor = estimator.deploy(instance_type='ml.g5.2xlarge')

You can also customize the input payload format type using serializers and content types. Here’s an example of setting the payload input format as JSON:

from sagemaker import serializers
from sagemaker import content_types

pretrained_predictor.serializer = serializers.JSONSerializer()
pretrained_predictor.content_type = 'application/json'

Conclusion

The simplified SageMaker JumpStart SDK makes it easy to build, train, and deploy models with just a few lines of code. You can use pre-trained models or fine-tune them for your specific tasks. The SDK also allows for customization to meet your requirements. Explore the available models and start leveraging AI to redefine your work processes.

About the Authors

Evan Kravitz, Rachna Chadha, Jonathan Guinegagne, and Dr. Ashish Khetan are experts in the field of AI and ML, with experience in developing and applying machine learning algorithms. They are part of the Amazon SageMaker JumpStart team, working to make AI accessible and impactful.

If you’re interested in evolving your company with AI, connect with us at hello@itinai.com. For more insights into leveraging AI, follow us on Telegram t.me/itinainews or Twitter @itinaicom.

Spotlight on a Practical AI Solution

Consider the AI Sales Bot from itinai.com/aisalesbot, designed to automate customer engagement and manage interactions across all stages of the customer journey. Discover how AI can redefine your sales processes and customer engagement. Explore solutions at itinai.com.

List of Useful Links:

AI Lab in Telegram @aiscrumbot – free consultation

Deploy and fine-tune foundation models in Amazon SageMaker JumpStart with two lines of code

AWS Machine Learning Blog

Twitter – @itinaicom

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Automation of internal processes.
Optimizing AI costs without huge budgets.
Training staff, developing custom courses for business needs
Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

Get a plan to reduce routine and improve metrics

100% of clients report increased productivity and reduced operati

AI Agents

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Job Title: Localization Project Manager Overview The Localization Project Manager plays a vital role in coordinating translation workflows while addressing vendor and process-related queries. This position is crucial for ensuring that translation projects are executed efficiently…
AI Agents

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Professional Summary The AI-driven Environmental Health & Safety Officer is a reliable and effective digital team member that performs repetitive and time-consuming tasks with remarkable speed, accuracy, and stability. By automating these tasks, it frees up…
AI Agents

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Job Title: Legal Contract Reviewer – Auto-flagging Clause Inconsistencies or Retrieving Precedent Cases for Review The AI functions as a reliable and effective digital team member that excels in performing repetitive and time-consuming tasks. With remarkable…
AI Agents

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Customer Retention Analyst Professional Summary A highly analytical and detail-oriented Customer Retention Analyst with a proven track record in creating comprehensive customer summaries, identifying churn risk patterns, and suggesting effective retention strategies. Adept at leveraging data-driven…

Itinai.com httpss.mj.runmrqch2uvtvo russian handsome charisma 9fdbb2d5 a55b 425d 8f3b 76d26f86710f 2

AI Business Accelerator

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

Have an audience (even 500+ followers in Instagram, email, etc.)
Have an idea, service, or product you want to scale
Can invest 2–3 hours a day
You’re motivated to earn with AI but don’t want to handle technical setup

AI news and solutions

Meet Hertz-Dev: An Open-Source 8.5B Audio Model for Real-Time Conversational AI with 80ms Theoretical and 120ms Real-World Latency on a Single RTX 4090

Unlocking Real-Time Conversational AI with Hertz-Dev The Challenge Conversational AI is essential in technology today, but achieving quick and efficient interactions can be tough. Latency, or the delay between a user’s input and the AI’s response,…

AI Tech News
WebDreamer: Enhancing Web Navigation Through LLM-Powered Model-Based Planning

Strategic Planning in AI Artificial intelligence has made great strides, especially in mastering complex games like Go. Large Language Models (LLMs) combined with advanced planning techniques have shown significant progress in handling complex reasoning tasks. However,…

AI Tech News
APEER: A Novel Automatic Prompt Engineering Algorithm for Passage Relevance Ranking

Solving Information Retrieval Challenges with APEER Automating Prompt Engineering for Enhanced LLM Performance A significant challenge in Information Retrieval (IR) using Large Language Models (LLMs) is the heavy reliance on human-crafted prompts for zero-shot relevance ranking.…

AI Tech News
Microsoft’s TAG-LLM: An AI Weapon for Decoding Complex Protein Structures and Chemical Compounds!

The integration of Large Language Models (LLMs) in scientific research signals a major advancement. Microsoft’s TAG-LLM framework addresses LLMs’ limitations in understanding specialized domains, utilizing meta-linguistic input tags to enhance their accuracy. TAG-LLM’s exceptional performance in…

AI Tech News
EASYTOOL: An Artificial Intelligence Framework Transforming Diverse and Lengthy Tool Documentation into a Unified and Concise Tool Instruction for Easier Tool Usage

“Large Language Models (LLMs) are powerful in AI but face challenges in efficiently using external tools. To address this, researchers introduce the ‘EASY TOOL’ framework, streamlining tool documentation for LLMs. It restructures, simplifies, and enhances tool…

AI Tech News
Can You Turn Your Vision-Language Model from a Zero-Shot Model to Any-Shot Generalist? Meet LIxP, the Context-Aware Multimodal Framework

Understanding Contrastive Language-Image Pretraining What is Contrastive Language-Image Pretraining? Contrastive language-image pretraining is a cutting-edge AI method that allows models to effectively connect images and text. This technique helps models understand the differences between unrelated data…

AI Tech News
CMU Researchers Propose XEUS: A Cross-lingual Encoder for Universal Speech trained in 4000+ Languages

Practical Solutions for Multilingual Speech Processing Introducing XEUS: A Cross-lingual Encoder for Universal Speech Self-supervised learning (SSL) has expanded the reach of speech technologies to many languages by minimizing the need for labeled data. However, current…

AI Tech News
Meet MiniChain: A Tiny Python Library for Coding with Large Language Models

MiniChain, a compact Python library, revolutionizes prompt chaining for large language models (LLMs). It simplifies the process by encapsulating prompt chaining essence, offers streamlined annotation, visualizing chains, efficient state management, separation of logic and prompts, flexible…

AI Tech News
This AI Paper Introduces ROMAS: A Role-Based Multi-Agent System for Efficient Database Monitoring and Planning

Understanding Multi-Agent Systems (MAS) Multi-agent systems (MAS) are crucial in artificial intelligence as they enable different agents to work together on complex tasks. They are especially useful in changing environments where they can assist with data…

AI Tech News
Meet EvaByte: An Open-Source 6.5B State-of-the-Art Tokenizer-Free Language Model Powered by EVA

Understanding Tokenization Challenges Tokenization breaks text into smaller parts, which is essential in natural language processing (NLP). However, it has several challenges: Struggles with multilingual text and out-of-vocabulary (OOV) words. Issues with typos, emojis, and mixed-code…

AI Tech News
Methods for generating synthetic descriptive data

The article explains methods for generating synthetic descriptive data in PySpark. It covers various sources for creating textual data, including random characters, APIs, third-party packages like Faker, and using Large Language Models (LLMs) such as ChatGPT.…

AI Tech News
AI-Generated Profile Pictures Could Get You a Job But At What Cost?

AI-driven apps are becoming popular for enhancing professional online images. Apps like Remini, Try It On AI, and AI Suit Up use artificial intelligence to create polished profile photos. While some users find these images to…

AI Tech News
IBM Researchers ACPBench: An AI Benchmark for Evaluating the Reasoning Tasks in the Field of Planning

Understanding LLMs and Their Role in Planning Large Language Models (LLMs) are becoming increasingly important as various industries explore artificial intelligence for better planning and decision-making. These models, particularly generative and foundational ones, are essential for…

AI Tech News
CodePMP: A Scalable Preference Model Pre-training for Supercharging Large Language Model Reasoning

Practical AI Solutions for Improving Large Language Model Reasoning Challenge in Enhancing LLMs’ Reasoning Abilities Enhancing reasoning abilities of Large Language Models (LLMs) for complex logical and mathematical tasks remains a challenge due to the lack…

AI Tech News
A New Study from Korea Introduces a Deep Learning-Based Approach to Screen for Autism and Symptom Severity Using Retinal Photographs

A recent study introduces a potential game-changer in diagnosing autism spectrum disorder (ASD) by utilizing retinal photographs and advanced deep-learning algorithms. The study showcases outstanding performance metrics, with the algorithms accurately distinguishing between individuals with ASD…

AI Tech News
Advancements in Knowledge Distillation and Multi-Teacher Learning: Introducing AM-RADIO Framework

Advancements in Knowledge Distillation and Multi-Teacher Learning: Introducing AM-RADIO Framework Knowledge Distillation has become a prominent technique for transferring knowledge from a “teacher” to a smaller “student” model, surpassing the teacher’s performance. This approach has extended…

AI Tech News
RABBITS: A Specialized Dataset and Leaderboard to Aid in Evaluating LLM Performance in Healthcare

AI Solutions for Biomedical NLP Enhancing Healthcare Delivery and Clinical Decision-Making Biomedical natural language processing (NLP) utilizes machine learning models to interpret medical texts, improving diagnostics, treatment recommendations, and medical information extraction. Challenges in Biomedical NLP…

AI Tech News
Geometry Distributions: Advancing Neural 3D Surface Modeling with Diffusion Models

Understanding Geometry Representations in 3D Vision Geometry representations are essential for addressing complex 3D vision challenges. With advancements in deep learning, there’s a growing focus on creating data structures that work well with neural networks. Coordinate…

AI Tech News
Meta AI Releases Llama Guard 3-1B-INT4: A Compact and High-Performance AI Moderation Model for Human-AI Conversations

Transforming Human-Technology Interaction with Generative AI Overview of Generative AI Generative AI is changing the way we interact with technology. It offers powerful tools for natural language processing and content creation. However, there are risks, such…

AI Tech News
Mixture of Experts and Sparsity – Hot AI topics explained

The release of smaller, more efficient AI models like Mistral’s Mixtral 8x7B has sparked interest in “Mixture of Experts” (MoE) and “Sparsity.” MoE breaks models into specialized “experts,” reducing training time and enhancing speed. Sparsity involves…

AI Tech News