ThursdAI - Recaps of the most high signal AI weekly spaces
ThursdAI - The top AI news from the past week
📅 ThursdAI - Jun 26 - Gemini CLI, Flux Kontext Dev, Search Live, Anthropic destroys books, Zucks superintelligent team & more AI news
0:00
-1:39:39

📅 ThursdAI - Jun 26 - Gemini CLI, Flux Kontext Dev, Search Live, Anthropic destroys books, Zucks superintelligent team & more AI news

From Weights & Biases, a special episode of ThursdAI (Wolfram took over while I'm on vacation 🏝️, and have tuned in as a regular listener), to discuss AI releases this week!

Hey folks, Alex here, writing from... a undisclosed tropical paradise location 🏝️ I'm on vacation, but the AI news doesn't stop of course, and neither does ThursdAI. So huge shoutout to Wolfram Ravenwlf for running the show this week, Nisten, LDJ and Yam who joined.

So... no long blogpost with analysis this week, but I'll def. recommend tuning in to the show that the folks ran, they had a few guests on, and even got some breaking news (new Flux Kontext that's open source)

Of course many of you are readers and are here for the links, so I'm including the raw TL;DR + speaker notes as prepared by the folks for the show!

P.S - our (rescheduled) hackathon is coming up in San Francisco, on July 12-13 called WeaveHacks, if you're interested at a chance to win a RoboDog, welcome to join us and give it a try. Register HERE

Image

Ok, that's it for this week, please enjoy the show and see you next week!

ThursdAI - June 26th, 2025 - TL;DR

  • Hosts and Guests

  • Open Source LLMs

    • Mistral Small 3.2 released with improved instruction following, reduced repetition & better function calling (X)

    • Unsloth AI releases dynamic GGUFs with fixed chat templates (X)

    • Kimi-VL-A3B-Thinking-2506 multimodal model updated for better video reasoning and higher resolution (Blog)

    • Chinese Academy of Science releases Stream-Omni, a new Any-to-Any model for unified multimodal input (HF, Paper)

    • Prime Intellect launches SYNTHETIC-2, an open reasoning dataset and synthetic data generation platform (X)

  • Big CO LLMs + APIs

    • Google

      • Gemini CLI, a new open-source AI agent, brings Gemini 2.5 Pro to your terminal (Blog, GitHub)

      • Google reduces free tier API limits for previous generation Gemini Flash models (X)

      • Search Live with voice conversation is now rolling out in AI Mode in the US (Blog, X)

      • Gemini API is now faster for video and PDF processing with improved caching (Docs)

    • Anthropic

      • Claude introduces an "artifacts" space for building, hosting, and sharing AI-powered apps (X)

      • Federal judge rules Anthropic's use of books for training Claude qualifies as fair use (X)

    • xAI

      • Elon Musk announces the successful launch of Tesla's Robotaxi (X)

    • Microsoft

      • Introduces Mu, a new language model powering the agent in Windows Settings (Blog)

    • Meta

      • Report: Meta pursued acquiring Ilya Sutskever's SSI, now hires co-founders Nat Friedman and Daniel Gross (X)

    • OpenAI

      • OpenAI removes mentions of its acquisition of Jony Ive's startup 'io' amid a trademark dispute (X)

      • OpenAI announces the release of DeepResearch in API + Webhook support (X)

  • This weeks Buzz

    • Alex is on vacation; WolframRvnwlf is attending AI Tinkerers Munich on July 25 (Event)

    • Join W&B Hackathon happening in 2 weeks in San Francisco - grand prize is a RoboDog! (Register for Free)

  • Vision & Video

    • MeiGen-MultiTalk code and checkpoints for multi-person talking head generation are released (GitHub, HF)

    • Google releases VideoPrism for generating adaptable video embeddings for various tasks (HF, Paper, GitHub)

  • Voice & Audio

    • ElevenLabs launches 11.ai, a voice-first personal assistant with MCP support (Sign Up, X)

    • Google Magenta releases Magenta RealTime, an open weights model for real-time music generation (Colab, Blog)

    • ElevenLabs launches a mobile app for iOS and Android for on-the-go voice generation (X)

  • AI Art & Diffusion & 3D

    • Google rolls out Imagen 4 and Imagen 4 Ultra in the Gemini API and Google AI Studio (Blog)

    • OmniGen 2 open weights model for enhanced image generation and editing is released (Project Page, Demo, Paper)

  • Tools

    • OpenMemory Chrome Extension provides shared memory across ChatGPT, Claude, Gemini and more (X)

    • LM Studio adds MCP support to connect local LLMs with your favorite servers (Blog)

    • Cursor is now available as a Slack integration (Dashboard)

    • All Hands AI releases the OpenHands CLI, a model-agnostic, open-source coding agent (Blog, Docs)

    • Warp 2.0 launches as an Agentic Development Environment with multi-threading (X)

  • Studies and Others

    • The /r/LocalLLaMA subreddit is back online after a brief moderation issue (Reddit, News)

    • Andrej Karpathy's talk "Software 3.0" discusses the future of programming in the age of AI (YouTube, Summary)

Thank you, see you next week!

Discussion about this episode

User's avatar