Microsoft teased that its ‘Copilot Vision’ feature is coming ‘very soon,’ enabling the AI assistant to see and understand a user’s browser content and behavior.
Microsoft launched adapted AI models, offering specialized small language models to address sector-specific challenges in manufacturing, automotive, and agriculture.
Microsoft began integrating Copilot AI features into standard Microsoft 365 subscriptions in certain Asia-Pacific markets, signaling a potential shift away from its separate Copilot Pro subscription model.
Google released ‘Grounding with Google Search’ for its Gemini API and AI studio, letting developers integrate real-time search results into model responses for reduced hallucinations and improved accuracy.
Google released a new standalone Gemini iPhone app featuring Gemini Live voice conversations, image generation capabilities, and broader integration with Google services.
Anthropic added new developer tools in its Console to automatically improve prompts, with the ability to manage examples and evaluate outputs to boost response accuracy and consistency.
NVIDIA has introduced an AI Blueprint that enables developers to create visual AI agents capable of analyzing and summarizing large volumes of video and image content.
Nvidia and SoftBank are testing the world's first telecom network that combines AI with 5G.
DeepL introduced Voice, a real-time translation service supporting 13 spoken languages and 33 written languages, initially focusing on text-based output for Teams meetings and in-person conversations.
Hume launched its new app featuring AI assistants that blend the company’s EVI 2 speech-language model with Claude 3.5 Sonnet and Haiku for conversational interactions, emotional reflection, deep questions, and life advice.
Rabbit AI is focusing on creating autonomous AI agents capable of performing tasks with minimal human intervention.
Wonder Dynamics announced Wonder Animation. It enables artists to shoot a scene with any camera, in any location, and turn the sequence into an animated scene.
Chinese AI video platform KLING is launching a ‘Custom Models’ feature, allowing users to train personalized video characters using 10-30 video clips for consistent appearances across scenes and camera angles.
Chinese tech giant Baidu will reportedly unveil AI-powered smart glasses equipped with voice and camera capabilities at its upcoming Baidu World event, positioning the product as a competitor to Meta’s Ray-Ban smart glasses at a lower price point.
Black Forest Labs has enhanced its FLUX1.1 pro model with two new modes — Ultra mode for 4x higher-resolution images and Raw mode for a more natural snapshot-style look.
Llama 3.2 Vision is now available to run in Ollama, in both 11B and 90B sizes.
AMD is getting in on the LLM game with a new open-source, 1B parameter model calledOLMo, which outperforms similar-sized compact LLMs like MobiLlama.
Suno showcased new demos of its soon-to-be-released v4 model, with enhanced audio samples demonstrating improved naturalness and consistency.
xAI launched a free tier of its Grok chatbot in select regions, offering limited access to Grok 2, Grok 2 mini, and image analysis capabilities.
Mistral just released an open-source platform that uses AI to spot and flag harmful content across nine categories and 11 languages.
InVideo launched a new AI video creation tool that can generate multi-minute videos with music and text in various styles from a single prompt.
Stripe introduced a new agent toolkit, enabling developers to integrate payments, financial services, and usage-based billing into LLM-powered agent workflows.
Apple released its Final Cut Pro 11 editing software, featuring new AI-powered features like Magnetic Mask for green screen-free object isolation and LLM-driven caption generation.