Author Archives:

Prepare for the NEET UG with practice tests in Gemini

We recently launched full-length, no-cost practice tests in Gemini, starting with the SAT and JEE Main. Today, we’re expanding practice tests to support the NEET UG.

We have grounded practice tests in rigorously vetted content from leading education companies like Physics Wallah and Careers360, to build a best in class experience for learners coming to Gemini. This helps ensure that you’re not just practicing — you’re preparing with material that more closely resembles what you’ll see on test day.

To try it out, just tell Gemini “I want to take a NEET mock exam.”


Note: This feature is currently available in English only.

Getting started

  • Admins: The Gemini app and related in-app tools are controlled by the Generative AI settings in the Workspace Admin console. Practice tests in Gemini are subject to these existing controls. Visit the Help Center for more information on turning the Gemini app on or off.
  • End users: End users of all ages who have access to the Gemini app will receive access to practice tests automatically. To get started, tell Gemini which practice test you want to take.

Rollout pace

Availability

  • Available to all Google Workspace customers, Workspace Individual subscribers, and users with personal Google accounts who are signed in to the Gemini app

Resources

Prepare for the NEET UG with practice tests in Gemini

We recently launched full-length, no-cost practice tests in Gemini, starting with the SAT and JEE Main. Today, we’re expanding practice tests to support the NEET UG.

We have grounded practice tests in rigorously vetted content from leading education companies like Physics Wallah and Careers360, to build a best in class experience for learners coming to Gemini. This helps ensure that you’re not just practicing — you’re preparing with material that more closely resembles what you’ll see on test day.

To try it out, just tell Gemini “I want to take a NEET mock exam.”


Note: This feature is currently available in English only.

Getting started

  • Admins: The Gemini app and related in-app tools are controlled by the Generative AI settings in the Workspace Admin console. Practice tests in Gemini are subject to these existing controls. Visit the Help Center for more information on turning the Gemini app on or off.
  • End users: End users of all ages who have access to the Gemini app will receive access to practice tests automatically. To get started, tell Gemini which practice test you want to take.

Rollout pace

Availability

  • Available to all Google Workspace customers, Workspace Individual subscribers, and users with personal Google accounts who are signed in to the Gemini app

Resources

Expanded NotebookLM capabilities for Education Plus and Teaching & Learning add-on customers

We’re increasing limits across NotebookLM features for customers with Google Workspace for Education Plus or a Teaching and Learning add-on license at no additional cost. Users will now see a Plus badge next to their profile picture, indicating access to higher usage thresholds.
Users with Google Workspace for Education Plus or a Teaching and Learning add-on license will benefit from several upgrades, including:

  • Increased source context: Support for more sources in each notebook.
  • More engagement: Send more chat queries per day.
  • Expanded study tools: Create more flashcard sets and quizzes.
  • More multimedia generation: Create more Video Overviews, Audio Overviews, infographics and slide decks.
Visit the Help Center to see a full list of NotebookLM features and limits.

Note: Certain NotebookLM features are only available to 18+ users. Visit the Help Center to learn more

Getting started

  • Admins: NotebookLM is ON by default and can be disabled at the domain, OU, or group level. Visit the Help Center to learn more.
  • End users: There is no end user setting for this feature. Visit the Help Center to learn more.

Rollout pace

Availability

  • Education: Education Plus; Teaching and Learning add-on

Resources

Expanded NotebookLM capabilities for Education Plus and Teaching & Learning add-on customers

We’re increasing limits across NotebookLM features for customers with Google Workspace for Education Plus or a Teaching and Learning add-on license at no additional cost. Users will now see a Plus badge next to their profile picture, indicating access to higher usage thresholds.
Users with Google Workspace for Education Plus or a Teaching and Learning add-on license will benefit from several upgrades, including:

  • Increased source context: Support for more sources in each notebook.
  • More engagement: Send more chat queries per day.
  • Expanded study tools: Create more flashcard sets and quizzes.
  • More multimedia generation: Create more Video Overviews, Audio Overviews, infographics and slide decks.
Visit the Help Center to see a full list of NotebookLM features and limits.

Note: Certain NotebookLM features are only available to 18+ users. Visit the Help Center to learn more

Getting started

  • Admins: NotebookLM is ON by default and can be disabled at the domain, OU, or group level. Visit the Help Center to learn more.
  • End users: There is no end user setting for this feature. Visit the Help Center to learn more.

Rollout pace

Availability

  • Education: Education Plus; Teaching and Learning add-on

Resources

Test Multi-Device Interactions with the Android Emulator

Posted by Steven Jenkins, Product Manager, Android Studio










Testing multi-device interactions is now easier than ever with the Android Emulator. Whether you are building a multiplayer game, extending your mobile application across form factors, or launching virtual devices that require a device connection, the Android Emulator now natively supports these developer experiences.

Previously, interconnecting multiple Android Virtual Devices (AVDs) caused significant friction. It required manually managing complex port forwarding rules just to get two emulators to connect.

Now you can take advantage of a new networking stack for the Android Emulator which brings zero-configuration peer-to-peer connectivity across all your AVDs.

Interconnecting emulator instances

The new networking stack for the Android Emulator transforms how emulators communicate. Previously, each virtual device operated on its own local area network (LAN), effectively isolating it from other AVDs. The new Wi-Fi network stack changes this by creating a shared virtual network backplane that bridges all running instances on the same host machine.

Key Benefits:

  • Zero-configuration: No more manual port forwarding or scripting adb commands. AVDs on the same host appear on the same virtual network.
  • Peer-to-peer connectivity: Critical protocols like Wi-Fi Direct and Network Service Discovery (NSD) work out of the box between emulators.
  • Improved stability: Resolves long-standing stability issues, such as data loss and connection drops found in the legacy stack.
  • Cross-platform consistency: Works the same across Windows, macOS and Linux.

Use Cases

The enhanced emulator networking supports a wide range of multi-device development scenarios:

  • Multi-device apps: Test file sharing, local multiplayer gaming, or control flows between a phone and another Android device.
  • Continuous Integration: Create robust, automated multi-device test pipelines without flaky network scripts.
  • Android XR & AI glasses: Easily test companion app pairing and data streaming between a phone and glasses within Android Studio.
  • Automotive & Wear OS: Validate connectivity flows between a mobile device and a vehicle head unit or smartwatch.



The new emulator networking stack allows multiple AVDs to share a virtual network, 
enabling direct peer-to-peer communication with zero configuration.

Get Started

The new networking capability is enabled by default in the latest Android Emulator release (36.5), which is available via the Android Studio SDK Manager. Just update your emulator and launch multiple devices!

If you need to disable this feature or want to learn more, please refer to our documentation.

As always, we appreciate any feedback. If you find a bug or issue, please file an issue. Also you can be part of our vibrant Android developer community on LinkedIn, Medium, Youtube, or X.

Google Workspace Updates Weekly Recap – April 10, 2026

Book Google Workspace resources from third-party calendars

Organizations that use both Google Workspace and other calendaring systems, like Microsoft Outlook, can now more easily coordinate shared resources, such as rooms, projectors, or company cars. | Learn more about how to book Google Workspace resources from third-party calendars.

Greater control and error visibility for Google Sheets formulas

We are introducing updates to the logic and parameter support for a targeted set of Google Sheets functions. These changes are designed to improve error visibility, offer more control over data, and ensure seamless compatibility when importing files. | Learn more about greater control and error visibility for Google Sheets formulas.

Migration update on restricted access items

With this update, all items with legacy restricted access will be automatically migrated to use the limited access setting instead. There will be no change to who can see or access the files. | Learn more about Migration update on restricted access items.

Speech translation in Google Meet is now rolling out to mobile devices

Following our recent general availability launch for web, we are excited to announce that speech translation is now rolling out to the Meet Android and iOS apps. The feature allows audio to be translated to other languages in near-real-time, helping global teams communicate more naturally and removing language barriers. | Learn more about speech translation in Google Meet is now out to mobile devices.

Edit your AI-generated scripts when you convert Slides to Vids

Now when you import Slides into Google Vids with Gemini enabled, you can see and edit your AI-generated scripts for each slide before completing the import, generating voiceovers, and applying animations. | Learn more about how to edit your AI-generated scripts when you convert Slides to Vids.

Gmail end-to-end encryption now available on mobile devices

We’re expanding Gmail end-to-end encryption (E2EE) to Android and iOS devices for Gmail client-side encryption (CSE) users. With Gmail E2EE, your users can confidentially engage with your organization's most sensitive data from anywhere on their mobile devices while ensuring data remains compliant and with your organizations sovereignty and compliance requirements. | Learn more about Gmail end-to-end encryption now available on mobile devices.

Expanding access to longer musical tracks in the Gemini app

Last month, we introduced Lyria 3 Pro in the Gemini app for select Business, Enterprise, and Education editions. Now, more users can create longer tracks with Lyria 3 Pro in the Gemini app. | Learn more about expanded access to longer musical tracks in the Gemini app.

The announcements above were published on the Workspace Updates blog over the last week. Please refer to the original blog posts for complete details.

Leveraging CPU memory for faster, cost-efficient TPU LLM training

Intel Xeon 6 Processor

Host offloading with JAX on Intel® Xeon® processors

As Large Language Models (LLMs) continue to scale into the hundreds of billions of parameters, device memory capacity has become a big limiting factor in training, as intermediate activations from every layer in the forward pass are needed in the backward pass. To reduce device memory pressure, these activations can be rematerialized during the backward pass, trading memory for recomputation. While rematerialization enables larger models to fit within limited device memory, it significantly increases training time and cost.

Intel® Xeon® processors (5th and 6th Gen) with Advanced Matrix Extensions (AMX) enable practical host offloading of selected memory- and compute-intensive components in JAX training workflows. This approach can help teams train larger models, relieve accelerator memory pressure, improve end-to-end throughput, and reduce total cost of ownership—particularly on TPU-based Google Cloud instances.

By publishing these results and implementation details, Google and Intel aim to promote transparency and share practical guidance with the community. This post describes how to enable activation offloading for JAX on TPU platforms and outlines considerations for building scalable, cost-aware hybrid CPU–accelerator training workflows.

Figure 1. Google Cloud TPU Pod commonly used in LLM training.

Host offloading

Traditional LLM training is usually done on device accelerators alone. However, modern host machines have much larger memory size than accelerators (512GB or more) and can offer extra compute power, e.g., TFLOPS in case of Intel® Xeon® Scalable Processor with AMX capability. Leveraging host resources can be a great alternative to rematerialization. Host offloading selectively moves computation or data between host and device to optimize performance and memory usage.

Host memory offloading keeps frequently-accessed tensors on the device and spills the rest to CPU memory as an extra level of cache. Activation offloading transfers activations computed on-device in the forward pass to the host, stores them in the host memory, and brings them back to the device in the backward pass for gradient computation. This unlocks the ability to train larger models, use bigger batch sizes, and improve throughput.

Figure 2: Memory offloading during forward and backward pass

In this blog post, we provide a practical guide to offload activations through JAX to efficiently train larger models on TPUs with an Intel® Xeon® Scalable Processor.

Enabling memory offloading in JAX

JAX offers multiple strategies for offloading activations, model parameters, and optimizer states to the host. Users can use checkpoint_names() to create a checkpoint for a tensor. The snippet below shows how to create a checkpoint  x:

from jax.ad_checkpoint import checkpoint_name 
 
def layer_name(x, w): 
  w1, w2 = w 
  x = checkpoint_name(x, "x") 
  y = x @ w1 
  return y @ w2, None  

Users can provide checkpoint_policies() to select the appropriate memory optimization strategy for intermediate values. There are three strategies:

  1. Recomputing during backward pass (default behavior)
  2. Storing on device
  3. Offloading to host memory after forward pass and loading back during backward pass

The code below moves x from device to the pinned host memory after the forward pass.
from jax import checkpoint_policies as cp

policy = cp.save_and_offload_only_these_names( 
  names_which_can_be_saved=[],         # No values stored on device 
  names_which_can_be_offloaded=["x"],  # Offload activations labeled "x" 
  offload_src="device",                # Move from device memory 
  offload_dst="pinned_host"            # To pinned host memory 
) 

Measuring Host Offloading Benefits on TPU v5p

We examined TPU host-offloading on JAX on both fine-tuning and training workloads. All our experiments were run on Google Cloud Platform, using a single v5p-8 TPU instance with single host 4th Gen Intel® Xeon® Scalable Processor.

Fine-tuning PaliGemma2: Using the base PaliGemma2 28B model for vision-language tasks, we fine-tuned the attention layers of the language model (Gemma2 27B) while keeping all other parameters frozen. During fine-tuning, we set the LLM sequence length to 256 and the batch size to 256.

The default checkpoint policy is nothing_saveable, which does not keep any activations on-device during the forward pass. The activations are rematerialized during the backward pass for gradient computation. While this approach reduces memory pressure on the TPU, it increases compute time. To apply host offloading, we offload Q, K, and V projection weights using save_and_offload_only_these_names. These activations are transferred to host memory (D2H) during the forward pass and fetched back during the backward pass (H2D), so the device neither stores nor recomputes them. Figure 2 shows 10% reduction in training time from host offloading. This translates directly into a similar reduction in TPU core-hours, yielding meaningful cost savings. The complete fine-tuning recipe is available at [JAX host offloading].

Figure 3: (Top) Training time comparison between full rematerialization and host offloading.
(Bottom) Memory analysis with and without host offloading.

Training Llama2-13B using MaxText: MaxText offers several rematerialization strategies that can be specified in the training configuration file. We used the policy remat_policy: 'qkv_proj_offloaded' to offload Q, K, and V projection weights. Figure 3 shows ~5% reduction in per-step training time compared to fully rematerializing all activations ( remat_policy: 'full').

Figure 4: MaxText Llama2-13B training statistics with and without host offloading.
The step time was 5% faster with host offloading.

When to offload activations

Activation offloading is beneficial when the time to transfer activations across host and device is lower than the time to recompute them. The timing depends on multiple factors such as PCIe bandwidth, model size, batch size, sequence length, activation tensor sizes, compute capabilities of the device, etc. An additional factor is how much the data movement can be overlapped with computation to keep the device busy. Figure 4 demonstrates an efficient overlap of the device-to-host transfer with compute during the backward pass in PaliGemma2 28B training.

Figure 5: A JAX trace of PaliGemma2 training viewed on Perfetto.
Memory offloading overlaps with compute effectively during backward pass host to device.

Smaller model variants such as PaliGemma2 3B and 9B did not see benefits from host offloading because it is faster to rematerialize all tensors than to transfer them to and from the host. Therefore, identifying the appropriate workload and offloading policy is crucial to realizing performance gain from host offloading

Call to Action

If you train on TPUs and are limited by device memory, consider evaluating activation offloading. Start by labeling candidate activations (for example, Q/K/V projections) and compare step time, memory headroom, and overall cost across representative workloads.

In our experiments, we observed up to ~10% improvement in end-to-end training time for larger workloads, which can reduce total cost of ownership (TCO) by shortening time-to-train or enabling the same workload on smaller instances.

Acknowledgments

Emilio Cota, and Karlo Basioli from Google and Eugene Zhulenev (formerly at Google).