기본 콘텐츠로 건너뛰기

Smaller, Safer, More Transparent: Advancing Responsible AI with Gemma

 Smaller, Safer, More Transparent: Advancing Responsible AI with Gemma





In June, we released Gemma 2, our new best-in-class open models, in 27 billion (27B) and 9 billion (9B) parameter sizes. Since its debut, the 27B model quickly became one of the highest-ranking open models on the LMSYS Chatbot Arena leaderboard, even outperforming popular models more than twice its size in real conversations.

But Gemma is about more than just performance. It's built on a foundation of responsible AI, prioritizing safety and accessibility. To support this commitment, we are excited to announce three new additions to the Gemma 2 family:

  1. Gemma 2 2B – a brand-new version of our popular 2 billion (2B) parameter model, featuring built-in safety advancements and a powerful balance of performance and efficiency.

2. ShieldGemma – a suite of safety content classifier models, built upon Gemma 2, to filter the input and outputs of AI models and keep the user safe.

3. Gemma Scope – a new model interpretability tool that offers unparalleled insight into our models' inner workings.

With these additions, researchers and developers can now create safer customer experiences, gain unprecedented insights into our models, and confidently deploy powerful AI responsibly, right on device, unlocking new possibilities for innovation.


Gemma 2 2B: Experience Next-Gen Performance, Now On-Device

We're excited to introduce the Gemma 2 2B model, a highly anticipated addition to the Gemma 2 family. This lightweight model produces outsized results by learning from larger models through distillation. In fact, Gemma 2 2B surpasses all GPT-3.5 models on the Chatbot Arena, demonstrating its exceptional conversational AI abilities.

Graph - LYMSYS Chatbot Arena leaderboard scores
LMSYS Chatbot Arena leaderboard scores captured on July 30th, 2024. Gemma 2 2B score +/- 10.

Gemma 2 2B offers:

  • Exceptional performance: Delivers best-in-class performance for its size, outperforming other open models in its category.

  • Flexible and cost-effective deployment: Run Gemma 2 2B efficiently on a wide range of hardware—from edge devices and laptops to robust cloud deployments with Vertex AI and Google Kubernetes Engine (GKE). To further enhance its speed, it is optimized with the NVIDIA TensorRT-LLM library and is available as an NVIDIA NIM. This optimization targets various deployments, including data centers, cloud, local workstations, PCs, and edge devices — using NVIDIA RTX, NVIDIA GeForce RTX GPUs, or NVIDIA Jetson modules for edge AI. Additionally, Gemma 2 2B seamlessly integrates with Keras, JAX, Hugging Face, NVIDIA NeMo, Ollama, Gemma.cpp, and soon MediaPipe for streamlined development.

Starting today, you can download Gemma 2’s model weights from KaggleHugging FaceVertex AI Model Garden. You can also try its capabilities in Google AI Studio.


ShieldGemma: Protecting Users with State-of-the-Art Safety Classifiers

Deploying open models responsibly to ensure engaging, safe, and inclusive AI outputs requires significant effort from developers and researchers. To help developers in this process, we're introducing ShieldGemma, a series of state-of-the-art safety classifiers designed to detect and mitigate harmful content in AI models inputs and outputs. ShieldGemma specifically targets four key areas of harm:

  • Hate speech

  • Harassment

  • Sexually explicit content

  • Dangerous content

Generative AI application model architecture

These open classifiers complement our existing suite of safety classifiers in the Responsible AI Toolkit, which includes a methodology to build classifiers tailored to a specific policy with limited number of datapoints, as well as existing Google Cloud off-the-shelf classifiers served via API.


Here's how ShieldGemma can help you create safer, better AI applications:

  • SOTA performance: Built on top of Gemma 2, ShieldGemma are the industry-leading safety classifiers.

  • Flexible sizes: ShieldGemma offers various model sizes to meet diverse needs. The 2B model is ideal for online classification tasks, while the 9B and 27B versions provide higher performance for offline applications where latency is less of a concern. All sizes leverage NVIDIA speed optimizations for efficient performance across hardware.

  • Open and collaborative: The open nature of ShieldGemma encourages transparency and collaboration within the AI community, contributing to the future of ML industry safety standards.


"As AI continues to mature, the entire industry will need to invest in developing high performance safety evaluators. We're glad to see Google making this investment, and look forward to their continued involvement in our AI Safety Working Group.” ~ Rebecca Weiss, Executive Director, ML Commons
Evaluation results based on Optimal F1(left)/AU-PRC(right), higher is better.
Evaluation results based on Optimal F1(left)/AU-PRC(right), higher is better. We use 𝛼=0 And T = 1 for calculating the probabilities. ShieldGemma (SG) Prompt and SG Response are our test datasets and OpenAI Mod/ToxicChat are external benchmarks. The performance of baseline models on external datasets is sourced from Ghosh et al. (2024); Inan et al. (2023).

Learn more about ShieldGemma, see full results in the technical report, and start building safer AI applications with our comprehensive Responsible Generative AI Toolkit.


Gemma Scope: Illuminating AI Decision-Making with Open Sparse Autoencoders

Gemma Scope offers researchers and developers unprecedented transparency into the decision-making processes of our Gemma 2 models. Acting like a powerful microscope, Gemma Scope uses sparse autoencoders (SAEs) to zoom in on specific points within the model and make its inner workings more interpretable.

These SAEs are specialized neural networks that help us unpack the dense, complex information processed by Gemma 2, expanding it into a form that's easier to analyze and understand. By studying these expanded views, researchers can gain valuable insights into how Gemma 2 identifies patterns, processes information, and ultimately makes predictions. With Gemma Scope, we aim to help the AI research community discover how to build more understandable, accountable, and reliable AI systems.

Here's what makes Gemma Scope groundbreaking:

  • Interactive demos: Explore SAE features and analyze model behavior without writing code on Neuronpedia.

Learn more about Gemma Scope on the Google DeepMind blogtechnical report, and developer documentation.


A Future Built on Responsible AI

These releases represent our ongoing commitment to providing the AI community with the tools and resources needed to build a future where AI benefits everyone. We believe that open access, transparency, and collaboration are essential for developing safe and beneficial AI.


Get Started Today:

  • Try Gemma Scope on Neuronpedia and uncover the inner workings of Gemma 2.

Join us on this exciting journey towards a more responsible and beneficial AI future!

댓글

이 블로그의 인기 게시물

Non-contact exposure to dinotefuran disrupts honey bee homing by altering MagR and Cry2 gene expression

  Non-contact exposure to dinotefuran disrupts honey bee homing by altering  MagR  and  Cry2  gene expression Dinotefuran is known to negatively affect honeybee ( Apis mellifera ) behavior, but the underlying mechanism remains unclear. The magnetoreceptor ( MagR , which responds to magnetic fields) and cryptochrome ( Cry2 , which is sensitive to light) genes are considered to play important roles in honey bees’ homing and localization behaviors. Our study found that dinotefuran, even without direct contact, can act like a magnet, significantly altering  MagR  expression in honeybees. This non-contact exposure reduced the bees’ homing rate. In further experiments, we exposed foragers to light and magnetic fields, the  MagR  gene responded to magnetic fields only in the presence of light, with  Cry 2 playing a key switching role in the magnetic field receptor mechanism ( MagR–Cry2 ). Yeast two-hybrid and BiFc assays confirmed an interactio...

“Global honey crisis”: Testing technology and local sourcing soars amid fraud and tampering concerns

  “Global honey crisis”: Testing technology and local sourcing soars amid fraud and tampering concerns The World Beekeeping Awards will not grant a prize for honey next year due to the “inability” to thoroughly test honey for adulteration. The announcement comes amid the rise of honey fraud in the EU, where a 2023 investigation found that 46% of 147 honey samples tested were likely contaminated with low-cost plant syrups.  Apimondia, the International Federation of Beekeepers’ Associations, organizes the event at its Congress, whose 49th edition will be held in Copenhagen, Denmark, in September 2025. The conference brings together beekeepers, scientists and other stakeholders. “We will celebrate honey in many ways at the Congress, but honey will no longer be a category, and thus, there will be no honey judging in the World Beekeeping Awards. The lessons learned from Canada 2019 and Chile 2023 were that adequate testing was impossible if we are to award winning honey at the Con...

Unveiling the Canopy's Secrets: New Bee Species Discovered in the Pacific

  Unveiling the Canopy's Secrets: New Bee Species Discovered in the Pacific In an exciting development for environmentalists and beekeeping experts, researchers have discovered eight new species of masked bees in the Pacific Islands, shining a light on the rich biodiversity hidden within the forest canopy. This discovery underscores the critical role bees play in our ecosystems and the pressing need for conservation efforts to protect these vital pollinators. A New Frontier in Bee Research By exploring the forest canopy, scientists have opened a new frontier in bee research, revealing species that have adapted to life high above the ground. These discoveries are crucial for understanding the complex relationships between bees, flora, and the broader ecosystem. The new species of masked bees, characterized by their striking black bodies with yellow or white highlights, particularly on their faces, rely exclusively on the forest canopy for survival. The Importance of Bee Conservation...

New Report – Interlocked: Midwives and the Climate Crisis

New Report – Interlocked: Midwives and the Climate Crisis Earlier this year, midwives from 41 countries shared their experiences of working in communities affected by climate change through our survey, Midwives’ Experiences and Perspectives on Climate Change. Their voices shaped our new report, Interlocked: Midwives and the Climate Crisis , which highlights how midwives are already responding to the health impacts of climate disasters like floods, wildfires, and extreme heat—and why they must be included in climate action plans. What did we learn?Climate change is damaging community health: 75% of midwives reported that climate change is harming the communities they serve, with rising rates of preterm births, food insecurity, and restricted access to care during disasters like floods. Midwives are critical first responders: Midwives are often the first and only healthcare providers on the ground in crises, delivering care during wildfires, floods, and extreme heat. Midwives face signi...

Bee attack claims life of newspaper distributor

  Bee attack claims life of newspaper distributor Newspaper distributor Pushparaja Shetty (45), who sustained severe injuries in a bee attack, succumbed to his injuries on Thursday at a hospital in Mangaluru. Pushparaja was attacked by a swarm of bees on Wednesday morning while walking at Kenjaru Taangadi under Bajpe town panchayat limits. He was immediately admitted to a hospital for treatment but could not survive the ordeal. Fondly known as ‘Boggu’ in the Porkodi area, Pushparaja was well-known for his dedication to delivering newspapers on foot to every household. He was admired for his generosity, as he often distributed sweets to schoolchildren on Independence Day using his own earnings and contributed part of his income to the betterment of society. Pushparaja was unmarried and is survived by three brothers and one sister.

Start the New Year Humming Like a Bee

  Start the New Year Humming Like a Bee There are lots of opportunities to be as busy as a bee during these winter holidays. As we hustle toward the dawn of the New Year, it can be hard to notice that the natural world is actually suggesting something different for us right now. We’re past the solstice, but the winter still stretches ahead, the days are still short and the nights long. We’re being invited into a quieter, more inner-focused time. The ancient yogis were all about this inner focus. In India, for example, the Upanishads, the Sanskrit writings that accompanied the development of Hinduism — and alongside it, yoga — beginning around 800 B.C.E., went deeper than earlier texts had into philosophy and questions of being. With the goals of increased inner awareness and higher consciousness, yoga was at that time not yet as focused on the body or on asanas, as it now can tend to be. But the yogis did develop many practices to try to open the way to those goals. They discovered...

The largest “killer hornets” in the world were exterminated in the US

  The largest “killer hornets” in the world were exterminated in the US The US informed that it had exterminated the worldʼs largest hornets, nicknamed "killer hornets" — they are capable of occupying a hive of honey bees in just 90 minutes, decapitating all its inhabitants and feeding their offspring to their own. This  was reported  by the Department of Agriculture in Washington. The hornets, which can reach five centimeters in length, were previously called Asian giant hornets, but in 2019 they were also spotted in Washington state near the Canadian border. In China, these insects killed 42 people and seriously injured 1,675. A dead northern giant hornet (below) next to a native bald hornet. According to experts, the hornets could have entered North America in plant pots or shipping containers. The hornet can sting through most beekeeper suits because it produces nearly seven times more venom than a honeybee and stings multiple times. Thatʼs why the Washington Departme...

From Classroom to Hive: Jeff Tech students experience sweet journey of honey making

  From Classroom to Hive: Jeff Tech students experience sweet journey of honey making The Courier Express has partnered with digital media arts students at Jeff Tech to highlight accomplishments and updates from the school. q q q REYNOLDSVILLE — The new “Intro to Agriculture” class, taught by advanced manufacturing instructor Perry Neal, has recently been buzzing throughout the halls of Jeff Tech. The course has been receiving positive feedback from both students and teachers. “It’s a great class. I love it,” said Jeff Tech student Jacob DeFoor. Student Kyle Lasher said, “I’m really considering getting bees of my own.” Intro to Agriculture is an 18-week course that starts with students learning anything and everything bees. They gather together to learn the process and safety procedures of making honey from scratch with locally-sourced honey bees. In class, students research pollination, foods that contain honey, where to purchase hive equipment, types of bees, etc., according to N...

The Unexpected Surge: America's Honeybees Buzz Back to Record Numbers

The Unexpected Surge: America's Honeybees Buzz Back to Record Numbers In an age where environmental narratives often lean towards loss and decline, the story of the American honeybee offers a glimmer of hope and a puzzle to solve. Recent data from the Census of Agriculture reveals an astonishing rebound in the honeybee population, now soaring to an all-time high of 3.8 million colonies. This revelation comes as a surprise against the backdrop of two decades marked by fears of colony collapse and the potential ramifications for ecosystems and agriculture. The surge in bee populations brings to light a series of questions and insights into the intertwined worlds of agriculture, conservation, and legislation. Central to this narrative is the state of Texas, where legislative changes have catalyzed a beekeeping boom by offering agricultural tax breaks to landowners cultivating honeybees. This policy shift, coupled with the entrepreneurial spirit of Texans and the essential role of bees...

Researchers use advanced robotics to study honeybee behaviour

  Researchers use advanced robotics to study honeybee behaviour Researchers from our top-rated Computer Science department have made significant advances in understanding honeybee behaviour through the use of innovative robotic technology. The study, published in the cover page of prestigious journal - Science Robotics, offers unprecedented insights into the daily activities of honeybee colonies, particularly focusing on the queen bee and her interactions with worker bees. Robotic system provides continuous monitoring The research team, led by Professor Farshad Arvin, developed a sophisticated robotic system capable of continuous, long-term observation of bee hives. This system employs two high-resolution cameras that work autonomously, tracking the queen bee's movements and mapping the contents of the honeycomb. This technology allows the researchers to collect data on bee behaviour 24 hours a day, seven days a week. Researchers say this level of continuous monitoring was previous...