Deploy and fine-tune foundation models in Amazon SageMaker JumpStart with two lines of code

The Amazon SageMaker JumpStart SDK has been simplified for building, training, and deploying foundation models. The code for prediction is now easier to use. This post demonstrates how to get started with using foundation models using the simplified SageMaker Jumpstart SDK in just a few lines of code. You can find more information about the SDK for deployment and training in the documentation. SageMaker JumpStart provides pre-trained models and solution templates for various problem types, as well as example notebooks. You can access these resources through the SageMaker JumpStart landing page or use the SageMaker Python SDK. The post also provides instructions on how to deploy and invoke the models, as well as customize the new classes in the SageMaker SDK. Overall, the simplified SageMaker JumpStart SDK offers an easy way to work with task-based and foundation models, and supports customization for specific requirements.

Simplified SageMaker JumpStart SDK for Building, Training, and Deploying Models

We are excited to introduce a simplified version of the Amazon SageMaker JumpStart SDK that makes it easy to build, train, and deploy foundation models. With just a few lines of code, you can get started with using pre-trained models and fine-tune them for your specific tasks.

Solution Overview

SageMaker JumpStart provides pre-trained, open-source models for various problem types, allowing you to quickly start your machine learning (ML) projects. You can incrementally train and fine-tune these models before deployment. JumpStart also offers solution templates and example notebooks for common ML use cases.

To demonstrate the capabilities of the new SageMaker JumpStart SDK, we show you how to use the pre-trained Flan T5 XL model from Hugging Face for text generation and summarization tasks. You can also use other models like Llama2, Falcon, or Mistral AI for text generation.

To deploy the model, you can use the model ID provided for each pre-trained model. In this example, we use the model ID for the Flan T5 XL model. After deployment, you can easily invoke the model to generate summaries of text.

Deploy and Invoke the Model

To deploy the Flan T5 XL model, you can use the simplified SageMaker JumpStart SDK. Simply instantiate the model object with the model ID and call the deploy method. Here’s an example:

from sagemaker.jumpstart.model import JumpStartModel

pretrained_model = JumpStartModel(model_id="huggingface-text2text-flan-t5-base")
pretrained_predictor = pretrained_model.deploy()

Once the model is deployed, you can invoke it by passing the text to the predictor. The response from the model will be returned as a Python dictionary. Here’s an example:

text = "Summarize this content - Amazon Comprehend uses natural language processing (NLP) to extract insights about the content of documents..."
query_response = pretrained_predictor.predict(text)
print(query_response["generated_text"])

This will generate a summary of the provided text using the Flan T5 XL model.

Fine-tune and Deploy the Model

The SageMaker JumpStart SDK also provides a JumpStartEstimator class for simplified fine-tuning. You can provide the location of your fine-tuning data and optionally pass validation datasets. After fine-tuning, you can deploy the model using the deploy method of the Estimator object. Here’s an example:

from sagemaker.jumpstart.estimator import JumpStartEstimator

estimator = JumpStartEstimator(
    model_id=model_id,
)
estimator.set_hyperparameters(instruction_tuned="True", epoch="3", max_input_length="1024")
estimator.fit({"training": train_data_location})
finetuned_predictor = estimator.deploy()

Customize the New Classes in the SageMaker SDK

The new SDK allows you to customize the deployment and invocation based on your requirements. You can override the defaults and customize parameters such as instance type, VPC configuration, and more. Here’s an example of overriding the instance type:

finetuned_predictor = estimator.deploy(instance_type='ml.g5.2xlarge')

You can also customize the input payload format type using serializers and content types. Here’s an example of setting the payload input format as JSON:

from sagemaker import serializers
from sagemaker import content_types

pretrained_predictor.serializer = serializers.JSONSerializer()
pretrained_predictor.content_type = 'application/json'

Conclusion

The simplified SageMaker JumpStart SDK makes it easy to build, train, and deploy models with just a few lines of code. You can use pre-trained models or fine-tune them for your specific tasks. The SDK also allows for customization to meet your requirements. Explore the available models and start leveraging AI to redefine your work processes.

About the Authors

Evan Kravitz, Rachna Chadha, Jonathan Guinegagne, and Dr. Ashish Khetan are experts in the field of AI and ML, with experience in developing and applying machine learning algorithms. They are part of the Amazon SageMaker JumpStart team, working to make AI accessible and impactful.

If you’re interested in evolving your company with AI, connect with us at hello@itinai.com. For more insights into leveraging AI, follow us on Telegram t.me/itinainews or Twitter @itinaicom.

Spotlight on a Practical AI Solution

Consider the AI Sales Bot from itinai.com/aisalesbot, designed to automate customer engagement and manage interactions across all stages of the customer journey. Discover how AI can redefine your sales processes and customer engagement. Explore solutions at itinai.com.

List of Useful Links:

AI Lab in Telegram @aiscrumbot – free consultation

Deploy and fine-tune foundation models in Amazon SageMaker JumpStart with two lines of code

AWS Machine Learning Blog

Twitter – @itinaicom

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

Automation of internal processes.
Optimizing AI costs without huge budgets.
Training staff, developing custom courses for business needs
Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

Get a plan to reduce routine and improve metrics

100% of clients report increased productivity and reduced operati

AI Agents

Localization Project Manager – Coordinating translation workflows, answering vendor or process-related questions.

Job Title: Localization Project Manager Overview The Localization Project Manager plays a vital role in coordinating translation workflows while addressing vendor and process-related queries. This position is crucial for ensuring that translation projects are executed efficiently…
AI Agents

Environmental Health & Safety Officer – Answering compliance-related questions, retrieving safety protocols or audit histories.

Professional Summary The AI-driven Environmental Health & Safety Officer is a reliable and effective digital team member that performs repetitive and time-consuming tasks with remarkable speed, accuracy, and stability. By automating these tasks, it frees up…
AI Agents

Legal Contract Reviewer – Auto-flagging clause inconsistencies or retrieving precedent cases for review.

Job Title: Legal Contract Reviewer – Auto-flagging Clause Inconsistencies or Retrieving Precedent Cases for Review The AI functions as a reliable and effective digital team member that excels in performing repetitive and time-consuming tasks. With remarkable…
AI Agents

Customer Retention Analyst – Creating customer summaries, identifying churn risk patterns, and suggesting retention steps.

Customer Retention Analyst Professional Summary A highly analytical and detail-oriented Customer Retention Analyst with a proven track record in creating comprehensive customer summaries, identifying churn risk patterns, and suggesting effective retention strategies. Adept at leveraging data-driven…

Itinai.com httpss.mj.runmrqch2uvtvo russian handsome charisma 9fdbb2d5 a55b 425d 8f3b 76d26f86710f 2

AI Business Accelerator

Start Your AI Business in Just a Week with itinai.com

You’re a great fit if you:

Have an audience (even 500+ followers in Instagram, email, etc.)
Have an idea, service, or product you want to scale
Can invest 2–3 hours a day
You’re motivated to earn with AI but don’t want to handle technical setup

AI news and solutions

Regularisation Techniques: Neural Networks 101

To prevent overfitting in neural networks, regularize by applying L1 (Lasso) and L2 (Ridge) penalties to loss functions, using early stopping based on validation set performance, implementing dropout, simplifying the architecture, gathering more data, and augmenting…

AI Tech News
UCLA Researchers Introduce ‘Rephrase and Respond’ (RaR): A New Artificial Intelligence Method that Enhances LLMs’ Understanding of Human Questions

Researchers at UCLA have developed a method called Rephrase and Respond (RaR) to improve the performance of Language Model LLMs. RaR allows LLMs to rephrase and expand human questions in a single prompt, demonstrating effectiveness across…

AI Tech News
Boost your Agile expertise by joining Agile Alliance today

Utilize unspent professional development funds by obtaining an Agile Alliance membership to enhance your Agile knowledge. This opportunity was first announced on the Agile Alliance website.

Scrum Agile News
Meta Reality Labs Introduce Lumos: The First End-to-End Multimodal Question-Answering System with Text Understanding Capabilities

Lumos, developed by Meta Reality Labs, is an innovative multimodal question-answering system that excels at extracting and understanding text from images, boosting Multimodal Large Language Models’ input. Its Scene Text Recognition component significantly enhances its performance,…

AI Tech News
This AI Research from Cohere for AI Compares Merging vs Data Mixing as a Recipe for Building High-Performant Aligned LLMs

Revolutionizing AI with Large Language Models (LLMs) Understanding the Challenge Large language models (LLMs) are transforming artificial intelligence by handling various tasks in multiple languages. The key challenge is ensuring safety while maintaining high performance, especially…

AI Tech News
VQ-VFM-OCL: A Breakthrough in Object-Centric Learning with Quantization-Based Vision Models

Understanding Object-Centric Learning (OCL) Object-centric learning (OCL) is an approach in computer vision that breaks down images into distinct objects. This helps in advanced tasks like prediction, reasoning, and decision-making. Traditional visual recognition methods often struggle…

AI Tech News
Enhancing Tool Usage in Large Language Models: The Path to Precision with Simulated Trial and Error

The development of large language models (LLMs) like OpenAI’s GPT series is transforming various sectors by generating rich and coherent text outputs. Integrating LLMs with external tools poses a challenge in tool usage accuracy, addressed by…

AI Tech News
Researchers at Microsoft Introduces VASA-1: Transforming Realism in Talking Face Generation with Audio-Driven Innovation

AI Tech News
Bridging Modalities with VisionLLaMA: A Unified Architecture for Vision Tasks

VisionLLaMA, a vision transformer, merges language and vision modalities. It introduces a tailored architecture, VisionLLaMA, to process 2D images effectively. The design retains LLaMA’s architecture and follows ViT’s pipeline, utilizing innovative features. VisionLLaMA achieves superior performance…

AI Tech News
The Future of Neural Network Training: Empirical Insights into μ-Transfer for Hyperparameter Scaling

AI Tech News
This AI Paper from Durham University Evaluates GPT-3.5 and GPT-4’s Performance Against Student Coders in Physics

AI Tech News
A New AI Research from China Introduces GLM-130B: A Bilingual (English and Chinese) Pre-Trained Language Model with 130B Parameters

Researchers from Tsinghua University and Zhipu.AI have released an open-source bilingual language model called GLM-130B with 130B parameters. GLM-130B outperforms GPT-3 and PaLM on various benchmarks, achieving a zero-shot accuracy of 80.2% on LAMBADA. The researchers…

AI Tech News
Building Responsible AI: Essential Guardrails for Trustworthy LLM Evaluation

The Rising Need for AI Guardrails As large language models (LLMs) become more advanced and widely used, the potential for unexpected behaviors, inaccuracies, and harmful outputs also rises. This is particularly important as AI systems are…

AI Tech News
A New Research Study from the University of Surrey Shows Artificial Intelligence Could Help Power Plants Capture Carbon Ising 36% Less Energy from the Grid

Researchers from the University of Surrey have used AI to improve carbon capture technology. By employing AI algorithms, they achieved a 16.7% increase in CO2 capture and reduced energy usage by 36.3%. The system employed packed…

AI Tech News
Top 25 AI Tools for Businesses in 2025

Transform Your Business with AI Artificial Intelligence (AI) is changing the way businesses operate, bringing efficiency, innovation, and improved customer satisfaction. By automating repetitive tasks and analyzing large datasets, AI helps businesses make better decisions. From…

AI Tech News
Why AI Language Models Are Still Vulnerable: Key Insights from Kili Technology’s Report on Large Language Model Vulnerabilities

Kili Technology’s Report on AI Vulnerabilities Understanding AI Language Model Vulnerabilities Kili Technology has released a report that reveals serious weaknesses in AI language models. These models are vulnerable to attacks that use misleading patterns, making…

AI Tech News
A Comprehensive Guide to Fine-Tuning ChatGPT for Your Business

Practical Solutions for Fine-Tuning ChatGPT Enhancing AI Capabilities Businesses can optimize their operations by leveraging AI, particularly through tools like OpenAI’s ChatGPT. Fine-tuning this model to match specific business needs is crucial for maximizing its potential…

AI Tech News
Meta Research Introduce System 2 Attention (S2A): An AI Technique that Enables an LLM to Decide on the Important Parts of the Input Context in Order to Generate Good Responses

Researchers from Meta have introduced a new approach called System 2 Attention (S2A) to improve the reasoning capabilities of Large Language Models (LLMs). LLMs often make simple mistakes due to weak reasoning and sycophancy. S2A mitigates…

AI Tech News
A New AI Study from MIT Shows How Deep Neural Networks Don’t See the World the Way We Do

Researchers have discovered that artificial neural networks designed to mimic human perception often exhibit invariances that do not match those found in human sensory perception. Model metamers, synthetic stimuli with similar activations to natural images or…

AI Tech News
HuggingFace Team Released FineVideo: A Comprehensive Dataset Featuring 43,751 YouTube Videos Across 122 Categories for Advanced Multimodal AI Analysis

HuggingFace Team Released FineVideo: A Comprehensive Dataset Featuring 43,751 YouTube Videos Across 122 Categories for Advanced Multimodal AI Analysis Background and Motivation HuggingFace has introduced FineVideo, a rich dataset designed to advance video comprehension, mood analysis,…

AI Tech News