Tag Archives: Gemini

Multimodal image attachment is now available for Gemini in Android Studio

Posted by Paris Hsu – Product Manager, Android Studio

At every stage of the development lifecycle, Gemini in Android Studio has become your AI-powered companion, making it easier to build high quality apps. We are excited to announce a significant expansion: Gemini in Android Studio now supports multimodal inputs, which lets you attach images directly to your prompts! This unlocks a wealth of new possibilities that improve team collaboration and UI development workflows.

You can try out this new feature by downloading the latest Android Studio canary. We’ve outlined a few use cases to try, but we’d love to hear what you think as we work through bringing this feature into future stable releases. Check it out:

Image attachment - a new dimension of interaction

We first previewed Gemini's multimodal capabilities at Google I/O 2024. This technology allows Gemini in Android Studio to understand simple wireframes, and transform them into working Jetpack Compose code.

You'll now find an image attachment icon in the Gemini chat window. Simply attach JPEG or PNG files to your prompts and watch Gemini understand and respond to visual information. We've observed that images with strong color contrasts yield the best results.

New “Attach Image File” icon in chat window
1.1 New “Attach Image File” icon in chat window

Example of multimodal response in chat
1.2 Example multimodal response in chat

We encourage you to experiment with various prompts and images. Here are a few compelling use cases to get you started:

    • Rapid UI prototyping and iteration: Convert a simple wireframe or high-fidelity mock of your app's UI into working code.
    • Diagram explanation and documentation: Gain deeper insights into complex architecture or data flow diagrams by having Gemini explain their components and relationships.
    • UI troubleshooting: Capture screenshots of UI bugs and ask Gemini for solutions.

Rapid UI prototyping and iteration

Gemini's multimodal support lets you convert visual designs into functional UI code. Simply upload your image and use a clear prompt. It works whether you're working from your own sketches or from a designer mockup.

Here’s an example prompt: "For this image provided, write Android Jetpack Compose code to make a screen that's as close to this image as possible. Make sure to include imports, use Material3, and document the code.” And then you can append any specific or additional instructions related to the image.

Example prompt: 'For this image provided, write Android Jetpack Compose code to make a screen that's as close to this image as possible. Make sure to include imports, use Material3, and document the code.'

Example of generating Compose code from high-fidelity mock using Gemini in Android Studio
2. Example of generating Compose code from high-fidelity mock using Gemini in Android Studio (code output)

For more complex UIs, refine your prompts to capture specific functionality. For instance, when converting a calculator mockup, adding "make the interactions and calculations work as you'd expect" results in a fully functional calculator:

Example prompt to convert a calculator mock up

Example of generating Compose code from high-fidelity mock using Gemini in Android Studio
3. Example of generating Compose code from wireframe via Gemini in Android Studio (code output)

Note: this feature provides an initial design scaffold. It’s a good “first draft” and your edits and adjustments will be needed. Common refinements include ensuring correct drawable imports and importing icons. Consider the generated code a highly efficient starting point, accelerating your UI development workflow.

Diagram explanation and documentation

With Gemini's multimodal capabilities, you can also try uploading an image of your diagram and ask for explanations or documentation.

Example prompt: Upload the Now in Android architecture diagram and say "Explain the components and data flow in this diagram" or “Write documentation about this diagram”.

Example of generating Compose code from high-fidelity mock using Gemini in Android Studio
4. Example of asking Gemini to help document the NowInAndroid architecture diagram

UI troubleshooting

Leverage Gemini's visual analysis to identify and resolve bugs quickly. Upload a screenshot of the problematic UI, and Gemini will analyze the image and suggest potential solutions. You can also include relevant code snippets for more precise assistance.

In the example below, we used Compose UI check and found that the button is stretched too wide in tablet screens, so we took a screenshot and asked Gemini for solutions - it was able to leverage the window size classes to provide the right fix.

Example of generating Compose code from high-fidelity mock using Gemini in Android Studio
5. Example of fixing UI bugs using Image Attachment (code output)

Download Android Studio today

Download the latest Android Studio canary today to try the new multimodal features!

As always, Google is committed to the responsible use of AI. Android Studio won't send any of your source code to servers without your consent. You can read more on Gemini in Android Studio's commitment to privacy.

We appreciate any feedback on things you like or features you would like to see. If you find a bug, please report the issue and also check out known issues. Remember to also follow us on X, Medium, or YouTube for more Android development updates!

“Take notes for me” in Google Meet is available in seven additional languages

What’s changing 

Today, we are excited to start rolling out support “take notes for me” in the following seven additional languages: 
  • French 
  • German 
  • Italian 
  • Japanese 
  • Korean 
  • Portuguese 
  • Spanish 

When you enable "take notes for me” in Google Meet, you'll see the language in which the notes will be taken. You can click on the language to change it or you can change your language from Settings > Meeting records > Language spoken in the meeting. Note that multilingual meetings are not supported at this time.

Turning “Take notes for me” on


All meeting participants will see a blue pencil icon on their screen and a notification that notes are being taken. They can click on the pencil to see the meeting notes taken so far.


Getting started


Rollout pace


Note: This update will be rolling out at a much slower pace than usual as we carefully monitor performance and quality. We'll update this post when the rollout for each language is complete.


Availability

Available to Google Workspace
  • Business Standard and Plus
  • Enterprise Standard and Plus
  • Also available with the Gemini Education Premium add-on

Anyone who previously purchased these add-ons will also receive this feature:
  • Gemini Enterprise*
  • AI Meetings & Messaging*

*As of January 15, 2025, we’re no longer offering the Gemini Enterprise add-ons for sale. Please refer to this announcement for more details.

Resources


More AI-powered features in Google Meet and Google Chat are coming to Google Workspace Business and Enterprise editions

What’s changing 

Earlier this year, we announced that we’re including the best of Google AI in Workspace Business and Enterprise plans without the need to purchase a separate Gemini add-on. Beginning today, additional AI-powered features are available for Business and Enterprise editions: 


For Google Meet: 
  • Generated background images: With Gemini in Meet, you can generate unique and bespoke meeting backgrounds. Meeting backgrounds can help obscure your surroundings during a meeting or they can enhance the meeting itself. Users will also benefit from our latest model upgrade for background generation, as well as additional style options, like a professional office, library, or home office to help refine their custom backgrounds. 
  • Studio look: Gemini in Meet uses machine learning to detect and enhance (if necessary) the quality of your portrait by reducing noise and increasing sharpness, bringing you into focus so you can look your best in meetings. 
  • Studio lighting: Using machine learning, Gemini in Meet will simulate studio-quality lighting and adjust light position and brightness in your video feed, so you're perfectly lit for your meeting. Note: Studio lighting is only available on devices that meet certain browser and processor requirements. Visit the Help Center to learn more about device requirements for studio lighting
  • Studio sound: Gemini in Meet will automatically recreate and balance missing or distorted frequencies helping to make your voice come through crisp and clear. This can be especially useful when dialing in via phone or using bluetooth headsets for example. 

For Google Chat: 
  • Translate for me: Chat will automatically detect and translate over 120 languages to a user’s preferred language while keeping the original message available for review. Instead of navigating outside of Chat to translate a message, this reduces friction and improves collaboration with colleagues, partners and customers in other parts of the world. 

For Google Drawings: 
  • Background image removal: Gemini’s image background removal functionality (also available in Slides and Vids) is now available in Google Drawings.

Getting started

Rollout pace

Availability

Available for Google Workspace:
  • Business Standard and Plus
  • Enterprise Standard and Plus

These features are already available to Gemini for Workspace add-on customers. Note that as of January 15, 2025, Gemini Business, Gemini Enterprise, AI Meetings & Messaging, and AI Security add-ons are no longer available for purchase. Please refer to this announcement for more details.



More AI-powered features in Google Meet and Google Chat are coming to Google Workspace Business and Enterprise editions

What’s changing 

Earlier this year, we announced that we’re including the best of Google AI in Workspace Business and Enterprise plans without the need to purchase a separate Gemini add-on. Beginning today, additional AI-powered features are available for Business and Enterprise editions: 


For Google Meet: 
  • Generated background images: With Gemini in Meet, you can generate unique and bespoke meeting backgrounds. Meeting backgrounds can help obscure your surroundings during a meeting or they can enhance the meeting itself. Users will also benefit from our latest model upgrade for background generation, as well as additional style options, like a professional office, library, or home office to help refine their custom backgrounds. 
  • Studio look: Gemini in Meet uses machine learning to detect and enhance (if necessary) the quality of your portrait by reducing noise and increasing sharpness, bringing you into focus so you can look your best in meetings. 
  • Studio lighting: Using machine learning, Gemini in Meet will simulate studio-quality lighting and adjust light position and brightness in your video feed, so you're perfectly lit for your meeting. Note: Studio lighting is only available on devices that meet certain browser and processor requirements. Visit the Help Center to learn more about device requirements for studio lighting
  • Studio sound: Gemini in Meet will automatically recreate and balance missing or distorted frequencies helping to make your voice come through crisp and clear. This can be especially useful when dialing in via phone or using bluetooth headsets for example. 

For Google Chat: 
  • Translate for me: Chat will automatically detect and translate over 120 languages to a user’s preferred language while keeping the original message available for review. Instead of navigating outside of Chat to translate a message, this reduces friction and improves collaboration with colleagues, partners and customers in other parts of the world. 

For Google Drawings: 
  • Background image removal: Gemini’s image background removal functionality (also available in Slides and Vids) is now available in Google Drawings.

Getting started

Rollout pace

Availability

Available for Google Workspace:
  • Business Standard and Plus
  • Enterprise Standard and Plus

These features are already available to Gemini for Workspace add-on customers. Note that as of January 15, 2025, Gemini Business, Gemini Enterprise, AI Meetings & Messaging, and AI Security add-ons are no longer available for purchase. Please refer to this announcement for more details.



Use Gemini in the side panel of Workspace apps in four more languages

What’s changing

Beginning today, Gemini in the side panel of Google Docs, Google Sheets, Google Drive, and Gmail can be used in four additional languages: 
  • Greek 
  • Catalan 
  • Indonesian 
  • Malay 

With Gemini in the side panel of your Workspace apps, you can get help summarizing, brainstorming, and generating content by utilizing insights gathered from your emails, documents, and more—all without switching applications or tabs. Check out our original announcements for Gemini in the side panel of Docs, Sheets, and Drive, and Gmail for even more information. Image generation is supported in these languages as well. 


Additional details 

  • Users may see the “Alpha” badge as we bring more features into Gemini in the side panel of Google Workspace.
  • Image generation of people is not supported in these additional languages at this time.

Getting started 


Rollout pace


Availability 

Gemini in the side panel of Docs, Sheets, Drive is available to Google Workspace: 
  • Business Standard and Plus 
  • Enterprise Standard and Plus 
  • Customers with the Gemini Education or Gemini Education Premium add-on 
  • Customers with the Gemini Business or Gemini Enterprise add-on* 
Gemini in the side panel of Gmail is available to Google Workspace:
  • Business Starter, Standard and Plus 
  • Enterprise Starter, Standard and Plus 
  • Customers with the Gemini Education or Gemini Education Premium add-on 
  • Customers with the Gemini Business or Gemini Enterprise add-on* 
*As of January 15, 2025, we’re no longer offering the Gemini Business and Gemini Enterprise add-ons for sale. Please refer to this announcement for more details.

Resources 

Create files and folders using Gemini in the side panel of Google Drive

What’s changing 

Since rolling out Gemini in the side panel of Google Drive, users have been able to summarize one or multiple documents, get quick facts about a project, interact with the Gemini side panel while viewing PDFs, and we most recently added folder support

Today, we’re excited to expand Gemini in Drive capabilities by introducing the ability to create new Google Docs, Sheets, Slides and folders in your Drive. As part of this update, we’re introducing support for two new types of prompts using Gemini in the side panel.
  • “Create a new folder” (with or without specifying what to name it) 
  • “Create a new Google Doc, Sheet or Slide” (with or without specifying what to name it) 
create new Google Docs, Sheets, Slides and folders in your Drive using Gemini


Who’s impacted 

End users 


Why you’d use it 

This new capability will help you streamline your file and folder creation journeys without needing to leave the side panel to create new Docs, Sheets, Slides and folders. By typing in one of the supported prompts, Gemini will create the new, titled file or folder and provide you with a link. 


Getting started 

Rollout pace 


Availability 

Available for Google Workspace: 
  • Business Standard and Plus 
  • Enterprise Standard and Plus 
  • Customers with the Gemini Education or Gemini Education Premium add-on 
  • Google One AI Premium 
Anyone who previously purchased these add-ons will also receive this feature: 
  • Gemini Business* 
  • Gemini Enterprise* 
*As of January 15, 2025, we’re no longer offering the Gemini Business and Gemini Enterprise add-ons for sale. Please refer to this announcement for more details.

Resources 

Quickly add events to Google Calendar based on your emails with Gemini in Gmail

What’s changing

In addition to asking Gemini in Gmail to perform calendar related actions or answer questions about your calendar, you can now add an event to your calendar directly from an email. 

With this update, Gemini will automatically detect calendar related content in your email and an “Add to calendar” button will appear. Upon clicking this option, the side panel in Gmail will open to confirm the event has been added to your calendar. 

"Add to Calendar" button in Gmail



Getting started

  • Admins: To access Gemini in the side panel of Workspace apps, users need to have smart features and personalization turned on. Admins can turn on default personalization setting for their users in the Admin console. 
  • End users: 
    • This feature is only available in English and on web at this time. 
    •  The "Add to calendar" button will not appear for emails with already extracted events (like restaurants, flights, etc.). 
    • A calendar event created via the “Add to calendar” button will not include other guests. 
    • Visit the Help Center to learn more about collaborating with Gemini in Gmail. 

Rollout pace 


Availability 

Available for Google Workspace: 
  • Business Starter, Standard, and Plus 
  • Enterprise Starter, Standard, and Plus 
  • Customers with the Gemini Education or Gemini Education Premium add-on 
  • Google One AI Premium 
Anyone who previously purchased these add-ons will also receive this feature: 
  • Gemini Business* 
  • Gemini Enterprise* 

*As of January 15, 2025, we’re no longer offering the Gemini Business and Gemini Enterprise add-ons for sale. Please refer to this announcement for more details.

Resources 

Enhancements for custom and AI-generated backgrounds in Google Meet

What’s changing

We’re introducing two improvements for creating custom background images with Gemini in Google Meet:

  • First, we’ve upgraded the image generation model, which will significantly improve the visual appeal and quality of generated backgrounds, while also better representing user requests.
  • Next, we’ve added several new preset styles to help you get started creating your own backgrounds. Specifically, you’ll see options for the following:
    • Professional office
    • Bookshelf
    • Stylish living room
    • Cozy living room
    • Tropical beach
    • Fantasy castle
    • Sci-fi spaceship


Getting started


Rollout pace

Availability

Available for Google Workspace:
  • Business Standard and Plus
  • Enterprise Standard and Plus
  • Also available with the Gemini Education or Gemini Education Premium add-on

Anyone who previously purchased these add-ons will also receive this feature:
  • Gemini Business*
  • Gemini Enterprise*
  • AI Meetings and Messages*

*As of January 15, 2025, we’re no longer offering the Gemini Business and Gemini Enterprise add-ons for sale. Please refer to this announcement for more details.

Resources

Use Gemini in the side panel of Google Slides in seven new languages

What’s changing

Beginning today, you can use Gemini in the side panel of Google Slides, which includes the ability to generate images, in the following seven new languages: 
  • French 
  • German 
  • Italian 
  • Japanese 
  • Korean 
  • Portuguese 
  • Spanish 
With Gemini in the side panel of your Workspace apps, you can get help summarizing, brainstorming, and generating content by utilizing insights gathered from your emails, documents, and more—all without switching applications or tabs. Check out our original announcements for Gemini in the side panel of Slides, Docs, Sheets, and Drive, and Gmail for even more information. 


Additional details 

  • Users may see the “Alpha” badge as we bring more features into Gemini in the side panel of Google Workspace. 
  • Image generation of people is not supported in these additional languages at this time. 

Getting started 

  • Admins: The default setting for Gemini features in Workspace services is on. See how you can manage access to AI features in Workspace services. 
  • End users: 
    • Gemini in the side panel will work according to the language you set in your Google account (myaccount.google.com/language). If you’re accessing other Gemini for Google Workspace features that are supported in English only, you will need to set your Google Account language to English. 
    • You can access the side panel by clicking on “Ask Gemini” (spark button) in the top right corner of Slides on the web. Visit the Help Center to learn more about collaborating with Gemini in the side panel of Slides

Rollout pace 

Availability 

Available to Google Workspace: 
  • Business Standard and Plus 
  • Enterprise Standard and Plus 
  • Customers with the Gemini Education or Gemini Education Premium add-on 
  • Customers with the Gemini Business or Gemini Enterprise add-on* 
*As of January 15, 2025, we’re no longer offering the Gemini Business and Gemini Enterprise add-ons for sale. Please refer to this announcement for more details. 

Resources

New devices at MWC, gaming news, XR & Gemini in Android Studio: Tune in for our winter episode of #TheAndroidShow on March 13!

Posted by Anirudh Dewani, Director – Android Developer Relations

In just a few days, on Thursday, March 13 at 10AM PT, we’ll be dropping our winter episode of #TheAndroidShow, on YouTube and on developer.android.com!

Mobile World Congress - the annual event in Barcelona where Android device makers show off their latest devices, kicked off yesterday. In our winter episode we’ll take a look at these foldables, tablets and wearables and tell you what you need to get building.

Plus we’ve got some news to share, like a new update for Gemini in Android Studio and some new goodies for games developers ahead of the Game Developer Conference (GDC) in San Francisco later this month. And of course, with the launch of Android XR in December, we’ll also be taking a look at how to get building there. It’s a packed show, and you don’t want to miss it!

Some new Android foldables and tablets, at Mobile World Congress

Mobile World Congress is a big moment for Android, with partners from around the world showing off their latest devices. And if you’re already building adaptive apps, we wanted to share some of the cool new foldable and tablets that our partners released in Barcelona:

    • OPPO: OPPO launched their Find N5, their slim 8.93mm foldable with a 8.12” large screen - making it as compact or expansive as needed.
    • Xiaomi: Xiaomi debuted the Xiaomi Pad 7 series. Xiaomi Pad 7 provides a crystal-clear display and, with the productivity accessories, users get a desktop-like experience with the convenience of a tablet.
    • Lenovo: Lenovo showcased their Yoga Tab Plus, the latest powerful tablet from their lineup designed to empower creativity and productivity.

These new devices are a great reason to build adaptive apps that scale across screen sizes and device types. Plus, Android 16 removes the ability for apps to restrict orientation and resizability at the platform level, so you’ll want to prepare. To help you get started, the Compose Material 3 adaptive library enables you to quickly and easily create layouts across all screen sizes while reducing the overall development cost.

Tune in to #TheAndroidShow: March 13 at 10AM PT

These new devices are just one of the many things we’ll cover in our winter episode, you don’t want to miss it! If you watch live on YouTube, we’ll have folks standing by to answer your questions in the comments. See you on March 13 on YouTube or at developer.android.com/events/show!