Image by Editor
# Introducing Opal
Google Opal is a no-code, experimental tool from Google Labs. It is designed to enable users to build and share AI-powered micro-applications using natural language. The tool converts text prompts into visual, editable workflows. This enables users to create AI applications quickly and easily.
…
How can we get large model level multimodal reasoning for documents, charts and videos while running only a 3B class model in production? Baidu has added a new model to the ERNIE-4.5 open source family. ERNIE-4.5-VL-28B-A3B-Thinking is a vision language model that focuses on document, chart and video understanding with a small active parameter budget.…
Image by Author
# Introduction
Email overload is real. You open your inbox to find dozens of messages, and you're immediately playing mental triage. Which ones need attention now? Which can wait?
It's exhausting, and it's easy to miss something important while you're drowning in notifications.
Here's a better way: build an automated…
Even strong ‘long-context’ AI models fail badly when they must track objects and counts over long, messy video streams, so the next competitive edge will come from models that predict what comes next and selectively remember only surprising, important events, not from just buying more compute and bigger context windows. A team of researchers from…
Sponsored Content
From November 6 to November 21, 2025 (8:00 a.m. UTC), 365 Data Science is opening its entire learning platform for free. This limited-time initiative gives aspiring AI professionals and data enthusiasts the chance to develop their skills and gain hands-on experience at no cost.
Tradition and Mission
…
Computer-use agents have been limited to primitives. They click, they type, they scroll. Long action chains amplify grounding errors and waste steps. Apple Researchers introduce UltraCUA, a foundation model that builds an hybrid action space that lets an agent interleave low level GUI actions with high level programmatic tool calls. The model chooses the cheaper…
How do you build a single model that can learn physical skills from chaotic real world robot data without relying on simulation? Generalist AI has unveiled GEN-θ, a family of embodied foundation models trained directly on high fidelity raw physical interaction data instead of internet video or simulation. The system is built to establish scaling…
Sponsored Content
Is your team using generative AI to enhance code quality, expedite delivery, and reduce time spent per sprint? Or are you still in the experimentation and exploration phase? Wherever you are on this journey, you can’t deny the fact that Gen AI is increasingly changing our reality today. It’s…
Can we render long texts as images and use a VLM to achieve 3–4× token compression, preserving accuracy while scaling a 128K context toward 1M-token workloads? A team of researchers from Zhipu AI release Glyph, an AI framework for scaling the context length through visual-text compression. It renders long textual sequences into images and processes…
Mathematics is the foundational language of the universe, providing the tools to describe everything from the laws of physics to the intricacies of biology and the logic of computer science. For centuries, its frontiers have been expanded by human ingenuity alone. At Google DeepMind, we believe AI can serve as a powerful tool to collaborate…