feat: Create Spatial Understanding Notebook for Gemini 2.0 #1651

holtskinner · 2025-01-23T21:48:14Z

Based on https://github.com/google-gemini/cookbook/blob/main/gemini-2/spatial_understanding.ipynb

- Based on https://github.com/google-gemini/cookbook/blob/main/gemini-2/spatial_understanding.ipynb

review-notebook-app · 2025-01-23T21:48:20Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

code-review-assist

Hi @holtskinner and reviewers,

I'm currently reviewing this pull request and will post my feedback in a few minutes. In the meantime, I'm providing this summary of the pull request's content to help you and other reviewers quickly understand the changes and intent.

This pull request, titled "feat: Create Spatial Understanding Notebook for Gemini 2.0", aims to create a Jupyter Notebook demonstrating spatial understanding capabilities with the Gemini 2.0 API in Vertex AI. The notebook leverages the Google Gen AI SDK and the gemini-2.0-flash-exp model.

Here's a summary of the changes:

A new Jupyter Notebook (gemini/use-cases/spatial-understanding/spatial_understanding.ipynb) was added. This notebook is approximately 675 lines long and contains comprehensive examples of spatial understanding tasks. The notebook includes sections on:
- Installation and authentication
- Setting up the Google Cloud project
- Importing necessary libraries
- Loading the Gemini model
- Defining helper functions for plotting bounding boxes
- Several examples demonstrating object detection and spatial reasoning with various images and prompts.
- Sharing options for the notebook (Colab, Colab Enterprise, Vertex AI Workbench, GitHub).

The notebook uses a BoundingBox Pydantic model to structure the output and includes safety settings to mitigate potential risks. The code effectively demonstrates how to use the Gemini API for tasks such as:
* Simply overlaying information on images
* Searching within images for specific objects
* Leveraging Gemini's reasoning capabilities for tasks like finding the shadow of an object.

The notebook also includes a YouTube video link demonstrating spatial understanding with Gemini 2.0.

I'll provide a more detailed review shortly, but this summary should give you a good overview of the changes.

Here's a little haiku to lighten the mood:

Boxes on the screen,
Gemini's sight, sharp and keen,
Code makes pictures speak.

code-review-assist

This pull request introduces a new Jupyter Notebook demonstrating spatial understanding with Gemini 2.0. The notebook is well-structured and provides clear explanations and examples. The code is generally clean and easy to follow. However, there are a few minor suggestions for improvement, primarily related to documentation and code style.

I've also noticed a few minor issues with the prompt text and some inconsistencies in the code. I've added comments and suggestions to address these issues and improve the overall quality of the notebook.

Overall, this is a valuable addition to the repository. Addressing the minor issues I've raised would further enhance its clarity and usefulness.

gemini/use-cases/spatial-understanding/spatial_understanding.ipynb

feat: Create Spatial Understanding Notebook for Gemini 2.0

d09feb2

- Based on https://github.com/google-gemini/cookbook/blob/main/gemini-2/spatial_understanding.ipynb

holtskinner requested a review from a team as a code owner January 23, 2025 21:48

code-review-assist bot reviewed Jan 23, 2025

View reviewed changes

Remove smart quotes

27070e3

holtskinner assigned gericdong Jan 23, 2025

code-review-assist bot reviewed Jan 23, 2025

View reviewed changes

gemini/use-cases/spatial-understanding/spatial_understanding.ipynb Show resolved Hide resolved

gemini/use-cases/spatial-understanding/spatial_understanding.ipynb Outdated Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Create Spatial Understanding Notebook for Gemini 2.0 #1651

feat: Create Spatial Understanding Notebook for Gemini 2.0 #1651

holtskinner commented Jan 23, 2025

review-notebook-app bot commented Jan 23, 2025

code-review-assist bot left a comment

code-review-assist bot left a comment

feat: Create Spatial Understanding Notebook for Gemini 2.0 #1651

Are you sure you want to change the base?

feat: Create Spatial Understanding Notebook for Gemini 2.0 #1651

Conversation

holtskinner commented Jan 23, 2025

review-notebook-app bot commented Jan 23, 2025

code-review-assist bot left a comment

Choose a reason for hiding this comment

code-review-assist bot left a comment

Choose a reason for hiding this comment