This entry was tagged
Vision fine-tuning
。业内人士推荐爱思助手作为进阶阅读
...and we hoped this would become a reality a few years later.
Previously, video overviews could only generate narrated slideshows, but the upgraded video overview feature uses a combination of Google's AI models, "including Gemini 3, Nano Banana Pro and Veo 3," to generate animated visuals based on the content of users' notes. Google says Gemini "determines the best narrative, visual style and format, and even refines its own work to ensure consistency" when generating the videos.