Google’s Data Science Agent now live in Colab, powered by Gemini 2.0

· 1 min read
Gemini

Google has announced the release of the Data Science Agent for all Google Colab users, marking a significant step in automating data analysis workflows. This feature, previously available to a limited group of trusted testers, is now accessible globally, allowing users to generate fully functional Colab notebooks using natural language descriptions. The tool is powered by Gemini 2.0, Google's advanced AI model.

The Data Science Agent simplifies the process of creating and running Python-based data analysis projects in Colab. Users can upload their datasets, describe their analysis objectives—such as visualizing trends, optimizing prediction models, or selecting statistical techniques—and let the agent generate complete notebooks. These notebooks include all necessary code, library imports, and analysis steps, which users can further modify or share using Colab's collaboration features.

Gemini

This release builds on feedback from testers who praised the agent's ability to generate concise and high-quality code while correcting errors. It has already been used in research environments like the Climate Department at Lawrence Berkeley National Laboratory, where it saved significant time on greenhouse gas data processing. Additionally, the Data Science Agent ranked 4th on HuggingFace's DABStep benchmark for multi-step reasoning, outperforming several other prominent AI agents.

To explore the feature, users can experiment with sample datasets such as the Stack Overflow Developer Survey or Iris Species dataset by providing simple prompts like "Visualize most popular programming languages" or "Train a random forest classifier." This functionality is designed to reduce setup time and allow users to focus on deriving insights from their data.

The feature is now live in Google Colab and aims to streamline data workflows for students, researchers, and professionals alike. Users can also join the Google Labs Discord server to share feedback and connect with other users in the #data-science-agent channel.