Google Launches Gemini 3.1 Flash-Lite - Built for Intelligence at Scale
A deep dive into today's release of Gemini 3.1 Flash-Lite, Google's new model optimized for low-latency, high-volume, and cost-sensitive LLM traffic.
A deep dive into today's release of Gemini 3.1 Flash-Lite, Google's new model optimized for low-latency, high-volume, and cost-sensitive LLM traffic.
Over the winter break, I finally achieved a goal I’ve been chasing for 15 years. I produced a full-length music album that sounds exactly the way I imagined it.
It’s titled American Intelligence, and it was created with help from Suno AI.
How to authenticate the Google Gen AI SDK.
Large language models (LLMs) like Gemini are incredibly powerful, but they have a fundamental limitation: their knowledge is frozen at the time they were trained. They don't have access to live, real-time information from the internet. This means if you ask about today's news, stock prices, or the weather, they can't give you a current answer.
This is where grounding comes in. Grounding connects the model to external, authoritative sources of information, like Google Search. By giving Gemini the ability to search the web, we can unlock its potential to answer questions about the here and now, ensuring its responses are timely, accurate, and verifiable. 🌐
How to turn Google Docs into speech.
How to build an AI interactive voice agent (IVA) with Gemini Multimodal Live & Twilio.