Gemma (language model)

❧

Gemma
Developer(s)	Google DeepMind
Initial release	February 21, 2024; 18 months ago (2024-02-21)^[1]

Stable release	Gemma 3 / March 12, 2025; 5 months ago (2025-03-12)^[2]

Type	Large language model
License	Gemma License
Website	deepmind.google/models/gemma/

Gemma is a series of open-source large language models developed by Google DeepMind. It is based on similar technologies as Gemini. The first version was released in February 2024, followed by Gemma 2 in June 2024 and Gemma 3 in March 2025. Variants of Gemma have also been developed, such as the vision-language model PaliGemma and the model DolphinGemma for understanding dolphin communication.

History

In February 2024, Google debuted Gemma, a family of free and open-source LLMs that serve as a lightweight version of Gemini. They come in two sizes, with a neural network with two and seven billion parameters, respectively. Multiple publications viewed this as a response to Meta and others open-sourcing their AI models, and a stark reversal from Google's longstanding practice of keeping its AI proprietary.^[3]^[4]^[5]

Gemma 2 was released on June 27, 2024,^[6] and Gemma 3 was released on March 12, 2025.^[2]^[7]

Overview

Based on similar technologies as the Gemini series of models, Gemma is described by Google as helping support its mission of "making AI helpful for everyone."^[8] Google offers official Gemma variants optimized for specific use cases, such as MedGemma for medical analysis and DolphinGemma for studying dolphin communication.^[9]

Since its release, Gemma models have had over 150 million downloads, with 70,000 variants available on Hugging Face.^[10]

The latest generation of models is Gemma 3, offered in 1, 4, 12, and 27 billion parameter sizes with support for over 140 languages. As multimodal models, they support both text and image input.^[11] Google also offers Gemma 3n, smaller models optimized for execution on consumer devices like phones, laptops, and tablets.^[12]

Architecture

The latest version of Gemma, Gemma 3, is based on a decoder-only transformer architecture with grouped-query attention (GQA) and the SigLIP vision encoder. Every model has a context length of 128K, with the exception of Gemma 3 1B, which has a context length of 32K.^[13]

Quantized versions fine-tuned using quantization-aware training (QAT) are also available,^[13] offering sizable memory usage improvements with some negative impact on accuracy and precision.^[14]

Variants

Google develops official variants of Gemma models designed for specific purposes, like medical analysis or programming. These include:

ShieldGemma 2 (4B): Based on the Gemma 3 family, ShieldGemma is designed to identify and filter violent, dangerous, and sexually explicit images.^[15]
MedGemma (4B and 27B): Also based on Gemma 3, MedGemma is designed for medical applications like image analysis. However, Google also notes that MedGemma "isn't yet clinical grade."^[16]
DolphinGemma (roughly 400M): Developed in collaboration with researchers at Georgia Tech and the Wild Dolphin Project, DolphinGemma aims to better understand dolphin communication through audio analysis.^[17]^[18]
CodeGemma (2B and 7B): CodeGemma is a group of models designed for code completion as well as general coding use.^[19] It supports multiple programming languages, including Python, Java, C++, and more.^[20]

Technical specifications of Gemma models
Generation	Release date	Parameters	Context length	Multimodal	Notes
Gemma 1	21 February 2024	2B, 7B	8,192	No	2B distilled from 7B. 2B uses multi-query attention while 7B uses multi-head attention.
CodeGemma		2B, 7B	8,192	No	Gemma 1 finetuned for code generation.
RecurrentGemma	11 April 2024	2B, 9B	Unlimited (trained on 8,192)	No	Griffin-based, instead of Transformer-based.^[21]
Gemma 2	27 June 2024	2B, 9B, 27B	8,192	No	27B trained from web documents, code, science articles. Gemma 2 9B was distilled from 27B. Gemma 2 2B was distilled from a 7B model that remained unreleased. Uses Grouped-Query Attention.^[22]
PaliGemma	10 July 2024	3B	8,192	Image	A vision-language model that takes text and image inputs, and outputs text. It is made by connecting a SigLIP-So400m image encoder with Gemma v1.0 2B.^[23]^[24]
PaliGemma 2	4 December 2024	3B, 10B, 28B	8,192	Image	Made by mating SigLIP-4o400m with Gemma v2.0 2B, 9B, and 27B. Capable of more vision-language tasks.^[25]^[26]
Gemma 3	12 March 2025	1B, 4B, 12B, 27B	131,072	Image	All models trained with distillation. Post-training focuses on math, coding, chat, instruction following, and multilingual (supports 140 languages). Capable of function calling. 1B is not capable of vision.^[27]

Note: open-weight models can have their context length rescaled at inference time. With Gemma 1, Gemma 2, PaliGemma, and PaliGemma 2, the cost is a linear increase of kv-cache size relative to context window size. With Gemma 3 there is an improved growth curve due to the separation of local and global attention. With RecurrentGemma the memory use is unchanged after 2,048 tokens.

References

↑ Banks, Jeanine; Warkentin, Tris. "Gemma: Introducing new state-of-the-art open models". The Keyword. Retrieved 16 August 2025.
1 2 "Introducing Gemma 3: The most capable model you can run on a single GPU or TPU". The Keyword. March 12, 2025.
↑ Khan, Jeremy (February 21, 2024). "Google unveils new family of open-source AI models called Gemma to take on Meta and others—deciding open-source AI ain't so bad after all". Fast Company. Archived from the original on February 21, 2024. Retrieved February 21, 2024.
↑ Alba, Davey (February 21, 2024). "Google Delves Deeper Into Open Source with Launch of Gemma AI Model". Bloomberg News. Archived from the original on February 21, 2024. Retrieved February 21, 2024.
↑ Metz, Cade; Grant, Nico (February 21, 2024). "Google Is Giving Away Some of the A.I. That Powers Chatbots". The New York Times. ISSN 0362-4331. Archived from the original on February 21, 2024. Retrieved February 21, 2024.
↑ "Gemma 2 is now available to researchers and developers". Google. 2024-06-27. Retrieved 2024-08-15.
↑ "Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM". Hugging Face. March 12, 2025.
↑ Banks, Jeanine; Warkentin, Tris (February 21, 2024). "Gemma: Introducing new state-of-the-art open models". The Keyword. Retrieved 13 July 2025.
↑ "Gemma - Google DeepMind". Google DeepMind. Retrieved 13 July 2025.
↑ Wiggers, Kyle (May 12, 2025). "Google's Gemma AI models surpass 150M downloads". TechCrunch. Retrieved 13 July 2025.
↑ Gosthipaty, Aritra; merve; Cuenca, Pedro; Srivastav, Vaibhav (March 12, 2025). "Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM". Hugging Face. Retrieved 13 July 2025.
↑ "Gemma 3n model overview". Google AI for Developers. Retrieved 13 July 2025.
1 2 Gemma Team (2025). "Gemma 3 Technical Report". arXiv:2503.19786v1 [cs.CL].
↑ Clark, Bryan (May 15, 2025). "What is quantization aware training?". IBM. Retrieved 14 July 2025.
↑ ShieldGemma Team (2025). "ShieldGemma 2: Robust and Tractable Image Content Moderation". arXiv:2504.01081 [cs.CV].
↑ "MedGemma". Google Health AI Developer Foundations. Retrieved 15 July 2025.
↑ "DolphinGemma: How Google AI is helping decode dolphin communication". Georgia Tech. Retrieved 15 July 2025.
↑ Herzing, Denise; Starner, Thad (April 14, 2025). "DolphinGemma: How Google AI is helping decode dolphin communication". The Keyword. Retrieved 15 July 2025.
↑ Irwin, Kate (April 10, 2024). "Google Launches Coding AIs That Could Rival Microsoft's GitHub Copilot". PCMag. Retrieved 15 July 2025.
↑ "CodeGemma". Google AI for Developers. Retrieved 15 July 2025.
↑ "RecurrentGemma: Moving Past Transformers for Efficient Open Language Models". arxiv.org.
↑ Gemma Team; Riviere, Morgane; Pathak, Shreya; Sessa, Pier Giuseppe; Hardin, Cassidy; Bhupatiraju, Surya; Hussenot, Léonard; Mesnard, Thomas; Shahriari, Bobak (2024-08-02), Gemma 2: Improving Open Language Models at a Practical Size, arXiv:2408.00118
↑ "PaLI: Scaling Language-Image Learning in 100+ Languages". research.google. Retrieved 2024-08-15.
↑ "PaliGemma: A versatile 3B VLM for transfer". arxiv.org. 2024-07-10.
↑ "Introducing PaliGemma 2 mix: A vision-language model for multiple tasks- Google Developers Blog". developers.googleblog.com. Retrieved 2025-02-22.
↑ "PaliGemma 2: A Family of Versatile VLMs for Transfer". arxiv.org.
↑ "Gemma 3 Technical Report". arxiv.org.

External links

Official website

Google AI

Computer programs

AlphaGo

Versions	AlphaGo (2015) Master (2016) AlphaGo Zero (2017) AlphaZero (2017) MuZero (2019)
Competitions	Fan Hui (2015) Lee Sedol (2016) Ke Jie (2017)
In popular culture	AlphaGo (2017) The MANIAC (2023)

Other

AlphaFold (2018)
AlphaStar (2019)
AlphaDev (2023)
AlphaGeometry (2024)
AlphaGenome (2025)

Machine learning

Neural networks	Inception (2014) WaveNet (2016) MobileNet (2017) Transformer (2017) EfficientNet (2019) Gato (2022)
Other	Quantum Artificial Intelligence Lab TensorFlow Tensor Processing Unit

Generative AI

Chatbots	Assistant (2016) Sparrow (2022) Gemini (2023)
Models	BERT (2018) XLNet (2019) T5 (2019) LaMDA (2021) Chinchilla (2022) PaLM (2022) Imagen (2023) Gemini (2023) VideoPoet (2024) Gemma (2024) Veo (2024)
Other	DreamBooth (2022) NotebookLM (2023) Vids (2024) Gemini Robotics (2025)

See also

Google

a subsidiary of Alphabet

Company

Divisions

Subsidiaries

Active

Defunct

Programs

Events

Infrastructure

111 Eighth Avenue
Android lawn statues
Androidland
Barges
Binoculars Building
Central Saint Giles
Chelsea Market
Chrome Zone
Data centers
GeoEye-1
Googleplex
Ivanpah Solar Power Facility
James R. Thompson Center
King's Cross
Mayfield Mall
Pier 57
Sidewalk Toronto
St. John's Terminal
Submarine cables
- Dunant
- Grace Hopper
- Unity
WiFi
YouTube Space
YouTube Theater

People

Current	Krishna Bharat Vint Cerf Jeff Dean John Doerr Sanjay Ghemawat Al Gore John L. Hennessy Urs Hölzle Salar Kamangar Ray Kurzweil Ann Mather Alan Mulally Rick Osterloh Sundar Pichai (CEO) Ruth Porat (CFO) Rajen Sheth Hal Varian Neal Mohan
Former	Andy Bechtolsheim Sergey Brin (co-founder) David Cheriton Matt Cutts David Drummond Alan Eustace Timnit Gebru Omid Kordestani Paul Otellini Larry Page (co-founder) Patrick Pichette Eric Schmidt Ram Shriram Amit Singhal Shirley M. Tilghman Rachel Whetstone Susan Wojcicki

Criticism

General	Censorship DeGoogle FairSearch "Google's Ideological Echo Chamber" No Tech for Apartheid Privacy concerns Street View YouTube Trade unions Alphabet Workers Union YouTube copyright issues
Incidents	Backdoor advertisement controversy Blocking of YouTube videos in Germany Data breach Elsagate Fantastic Adventures scandal Kohistan video case Reactions to Innocence of Muslims San Francisco tech bus protests Services outages Slovenian government incident Walkouts YouTube headquarters shooting

Other

Development

Software

A–C	Accelerated Linear Algebra AMP Actions on Google ALTS American Fuzzy Lop Android Cloud to Device Messaging Android Debug Bridge Android NDK Android Runtime Android SDK Android Studio Angular AngularJS Apache Beam APIs App Engine App Inventor App Maker App Runtime for Chrome AppJet Apps Script AppSheet ARCore Base Bazel BeyondCorp Bigtable BigQuery Bionic Blockly Borg Caja Cameyo Chart API Charts Chrome Frame Chromium Blink Closure Tools Cloud Connect Cloud Dataflow Cloud Datastore Cloud Messaging Cloud Shell Cloud Storage Code Search Compute Engine Cpplint
D–N	Dalvik Data Protocol Dialogflow Exposure Notification Fast Pair Fastboot Federated Learning of Cohorts File System Firebase Firebase Studio Firebase Cloud Messaging FlatBuffers Flutter Freebase Gadgets Ganeti Gears Gerrit GLOP gRPC Gson Guava Guetzli Guice gVisor GYP JAX Jetpack Compose Keyhole Markup Language Kubernetes Kythe LevelDB Lighthouse Looker Studio lmctfy MapReduce Mashup Editor Matter Mobile Services Namebench Native Client Neatx Neural Machine Translation Nomulus
O–Z	Open Location Code OpenRefine OpenSocial Optimize OR-Tools Pack PageSpeed Piper Plugin for Eclipse Polymer Programmable Search Engine Project Shield Public DNS reCAPTCHA RenderScript SafetyNet SageTV Schema.org Search Console Shell Sitemaps Skia Graphics Engine Spanner Sputnik Stackdriver Swiffy Tango TensorFlow Tesseract Test Translator Toolkit Urchin UTM parameters V8 VirusTotal VisBug Wave Federation Protocol Weave Web Accelerator Web Designer Web Server Web Toolkit Webdriver Torso WebRTC

Operating systems

Android
- Cupcake
- Donut
- Eclair
- Froyo
- Gingerbread
- Honeycomb
- Ice Cream Sandwich
- Jelly Bean
- KitKat
- Lollipop
- Marshmallow
- Nougat
- Oreo
- Pie
- 10
- 11
- 12
- 13
- 14
- 15
- 16
- version history
- smartphones
Android Automotive
Android Go
- devices
Android Things
Android TV
- devices
Android XR
ChromeOS
ChromeOS Flex
ChromiumOS
Fuchsia
Glass OS
gLinux
Goobuntu
TV
Wear OS

Machine learning models

Neural networks

Computer programs

Formats and codecs

Programming languages

Search algorithms

Domain names

Typefaces

Software

A	Aardvark Account Dashboard Takeout Ad Manager AdMob Ads AdSense Affiliate Network Alerts Allo Analytics Android Auto Android Beam Answers Apture Arts & Culture Assistant Attribution Authenticator
B	BebaPay BeatThatQuote.com Beam Blog Search Blogger Body Bookmarks Books Ngram Viewer Browser Sync Building Maker Bump BumpTop Buzz
C	Calendar Cast Catalogs Chat Checkout Chrome Chrome Apps Chrome Experiments Chrome Remote Desktop Chrome Web Store Classroom Cloud Print Cloud Search Contacts Contributor Crowdsource Currents (social app) Currents (news app)
D	Data Commons Dataset Search Desktop Dictionary Dinosaur Game Directory Docs Docs Editors Domains Drawings Drive Duo
E	Earth Etherpad Expeditions Express
F	Family Link Fast Flip FeedBurner fflick Fi Wireless Finance Files Find Hub Fit Flights Flu Trends Fonts Forms Friend Connect Fusion Tables
G	Gboard Gemini Gesture Search Gizmo5 Google+ Gmail Goggles GOOG-411 Grasshopper Groups
H	Hangouts Helpouts
I	iGoogle Images Image Labeler Image Swirl Inbox by Gmail Input Tools Japanese Input Pinyin Insights for Search
J	Jaiku Jamboard
K	Kaggle Keep Knol
L	Labs Latitude Lens Like.com Live Transcribe Lively
M	Map Maker Maps Maps Navigation Marketing Platform Meet Messages Moderator My Tracks
N	Nearby Share News News & Weather News Archive Notebook NotebookLM Now
O	Offers One One Pass Opinion Rewards Orkut Oyster
P	Panoramio PaperofRecord.com Patents Page Creator Pay (mobile app) Pay (payment method) Pay Send People Cards Person Finder Personalized Search Photomath Photos Picasa Picasa Web Albums Picnik Pixel Camera Play Play Books Play Games Play Music Play Newsstand Play Pass Play Services Podcasts Poly Postini PostRank Primer Public Alerts Public Data Explorer
Q	Question Hub Quick, Draw! Quick Search Box Quick Share Quickoffice
R	Read Along Reader Reply
S	Safe Browsing SageTV Santa Tracker Schemer Scholar Search AI Overviews Knowledge Graph SafeSearch Searchwiki Sheets Shoploop Shopping Sidewiki Sites Slides Snapseed Socratic Softcard Songza Sound Amplifier Spaces Sparrow (chatbot) Sparrow (email client) Speech Recognition & Synthesis Squared Stadia Station Store Street View Surveys Sync
T	Tables Talk TalkBack Tasks Tenor Tez Tilt Brush Toolbar Toontastic 3D Translate Travel Trendalyzer Trends TV
U	URL Shortener
V	Video Vids Voice Voice Access Voice Search
W	Wallet Wave Waze WDYL Web Light Where Is My Train Widevine Wiz Word Lens Workspace Workspace Marketplace
Y	YouTube YouTube Kids YouTube Music YouTube Premium YouTube Shorts YouTube Studio YouTube TV YouTube VR

Hardware

Pixel

Smartphones	Pixel (2016) Pixel 2 (2017) Pixel 3 (2018) Pixel 3a (2019) Pixel 4 (2019) Pixel 4a (2020) Pixel 5 (2020) Pixel 5a (2021) Pixel 6 (2021) Pixel 6a (2022) Pixel 7 (2022) Pixel 7a (2023) Pixel Fold (2023) Pixel 8 (2023) Pixel 8a (2024) Pixel 9 (2024) Pixel 9 Pro Fold (2024) Pixel 9a (2025)
Smartwatches	Pixel Watch (2022) Pixel Watch 2 (2023) Pixel Watch 3 (2024)
Tablets	Pixel C (2015) Pixel Slate (2018) Pixel Tablet (2023)
Laptops	Chromebook Pixel (2013–2015) Pixelbook (2017) Pixelbook Go (2019)
Other	Pixel Buds (2017–present)

Nexus

Smartphones	Nexus One (2010) Nexus S (2010) Galaxy Nexus (2011) Nexus 4 (2012) Nexus 5 (2013) Nexus 6 (2014) Nexus 5X (2015) Nexus 6P (2015)
Tablets	Nexus 7 (2012) Nexus 10 (2012) Nexus 7 (2013) Nexus 9 (2014)
Other	Nexus Q (2012) Nexus Player (2014)

Other

Android Dev Phone
Android One
Cardboard
Chromebit
Chromebook
Chromebox
Chromecast
Clips
Daydream
Fitbit
Glass
Liftware
Liquid Galaxy
Nest
- smart speakers
- Thermostat
- Wifi
Play Edition
Project Ara
OnHub
Pixel Visual Core
Project Iris
Search Appliance
Sycamore processor
Tensor
Tensor Processing Unit
Titan Security Key

v t e Litigation
Advertising	Feldman v. Google, Inc. (2007) Rescuecom Corp. v. Google Inc. (2009) Goddard v. Google, Inc. (2009) Rosetta Stone Ltd. v. Google, Inc. (2012) Google, Inc. v. American Blind & Wallpaper Factory, Inc. (2017) Jedi Blue
Antitrust	European Union (2010–present) United States v. Adobe Systems, Inc., Apple Inc., Google Inc., Intel Corporation, Intuit, Inc., and Pixar (2011) Umar Javeed, Sukarma Thapar, Aaqib Javeed vs. Google LLC and Ors. (2019) United States v. Google LLC (2020) United States v. Google LLC (2023)
Intellectual property	Perfect 10, Inc. v. Amazon.com, Inc. (2007) Viacom International Inc. v. YouTube, Inc. (2010) Lenz v. Universal Music Corp.(2015) Authors Guild, Inc. v. Google, Inc. (2015) Field v. Google, Inc. (2016) Google LLC v. Oracle America, Inc. (2021) Smartphone patent wars
Privacy	Rocky Mountain Bank v. Google, Inc. (2009) Hibnick v. Google, Inc. (2010) United States v. Google Inc. (2012) Judgement of the German Federal Court of Justice on Google's autocomplete function (2013) Joffe v. Google, Inc. (2013) Mosley v SARL Google (2013) Google Spain v AEPD and Mario Costeja González (2014) Frank v. Gaos (2019)
Other	Garcia v. Google, Inc. (2015) Google LLC v Defteros (2020) Epic Games v. Google (2021) Gonzalez v. Google LLC (2022)

Concepts

Products

Android	Booting process Custom distributions Features Recovery mode Software development
Street View coverage	Africa Antarctica Asia Israel Europe North America Canada United States Oceania South America Argentina Chile Colombia
YouTube	Copyright strike Education Features Moderation Most-disliked videos Most-liked videos Most-subscribed channels Most-viewed channels Most-viewed videos Arabic music videos Chinese music videos French music videos Indian videos Pakistani videos Official channel Social impact YouTube Premium original programming
Other	Gmail interface Maps pin Most downloaded Google Play applications Stadia games

Documentaries

Books

Google Hacks
The Google Story
Googled: The End of the World as We Know It
How Google Works
I'm Feeling Lucky
In the Plex
The Google Book
The MANIAC

Popular culture

Google Feud
Google Me (film)
"Google Me" (Kim Zolciak song)
"Google Me" (Teyana Taylor song)
Is Google Making Us Stupid?
Proceratium google
Matt Nathanson: Live at Google
The Billion Dollar Code
The Internship
Where on Google Earth is Carmen Sandiego?

Other

Italics denote discontinued products.

Generative AI

Concepts

Chatbots

Models

Text	Claude Gemini Gemma GPT 1 2 3 J 4 4o 4.5 4.1 OSS 5 Llama o1 o3 o4-mini Qwen
Coding	Base44 Claude Code Cursor Devstral GitHub Copilot Kimi-Dev Qwen3-Coder Replit Xcode
Image	Aurora Firefly Flux GPT Image 1 Ideogram Imagen Leonardo Midjourney Qwen-Image Recraft Seedream Stable Diffusion
Video	Dream Machine Hailuo AI Kling Midjourney Video Runway Gen Seedance Sora Veo Wan
Speech	15.ai Eleven MiniMax Speech 2.5 WaveNet
Music	Eleven Music Endel Lyria Riffusion Suno AI Udio

Agents

Companies

Category

Artificial intelligence (AI)

Concepts

Applications

Implementations

Audio–visual	AlexNet WaveNet Human image synthesis HWR OCR Computer vision Speech synthesis 15.ai ElevenLabs Speech recognition Whisper Facial recognition AlphaFold Text-to-image models Aurora DALL-E Firefly Flux Ideogram Imagen Midjourney Recraft Stable Diffusion Text-to-video models Dream Machine Runway Gen Hailuo AI Kling Sora Veo Music generation Riffusion Suno AI Udio
Text	Word2vec Seq2seq GloVe BERT T5 Llama Chinchilla AI PaLM GPT 1 2 3 J ChatGPT 4 4o o1 o3 4.5 4.1 o4-mini 5 Claude Gemini Gemini (language model) Gemma Grok LaMDA BLOOM DBRX Project Debater IBM Watson IBM Watsonx Granite PanGu-Σ DeepSeek Qwen
Decisional	AlphaGo AlphaZero OpenAI Five Self-driving car MuZero Action selection AutoGPT Robot control

People

Architectures

Category