- Get The Gist
- Posts
- OpenAI Reveals New “Operator” AI Agent
OpenAI Reveals New “Operator” AI Agent
Plus: Google Launches Gemini AI App for iPhone
Welcome to Get The Gist, where every weekday we share an easy-to-read summary of the latest and greatest developments in AI—news, innovations, and trends—all delivered in under 5 minutes! ⏱
In today’s edition:
OpenAI Reveals New “Operator” AI Agent
Google Launches Gemini AI App for iPhone
YouTube Tests AI Music Remixing Feature for Shorts
Perplexity Introduces AI-Generated Ads
And more AI news….
Top Developments
OpenAI
Image by: Bloomberg
The Gist: OpenAI is developing a new AI agent called "Operator" designed to automate complex tasks like booking travel and writing code with minimal user supervision, slated for a preview release in early 2025.
Key Details:
"Operator" will be a general-purpose AI agent capable of using a web browser to execute multi-step tasks, available for developers via API.
CEO Sam Altman emphasized that agents represent the next big breakthrough in AI, hinting at a shift beyond traditional chatbot models.
Google is also in the race with “Project Jarvis,” an AI agent under development to automate tasks on Google Chrome by interacting with web elements.
Unlike standard chatbots, AI agents store past interactions and plan future actions, making them more autonomous and efficient at completing repetitive tasks.
Google Gemini
Image by: Android Police
The Gist: Google’s new Gemini AI app for iPhone offers enhanced, standalone AI functionality, including voice and text queries, Dynamic Island integration, and the premium Gemini Advanced plan.
Key Details:
Gemini Live is now accessible on iPhone, allowing users to manage AI interactions directly from the Dynamic Island and Lock Screen.
The app is free with in-app subscriptions; Gemini Advanced, part of the Google One premium plan, costs $18.99 monthly and includes priority access to advanced features.
This standalone release brings iOS users closer to the Android experience, with expanded regional availability following its soft launch in the Philippines.
Gemini Advanced subscribers get access to Google’s 1.5 Pro model, extended memory (one million tokens), and Gemini tools across Google services like Mail and Docs.
Youtube
Image by: Yoututbe
The Gist: YouTube is trialing an AI-powered tool for creators to remix licensed music tracks on Shorts by adjusting genre, mood, and style through text prompts.
Key Details:
The feature, “Restyle a track,” allows select creators to generate 30-second custom remixes of licensed songs for their Shorts by describing the desired style.
The trial includes AI-generated voices of participating artists like Charlie Puth, Charli XCX, Demi Lovato, and John Legend, though specific available tracks are undisclosed.
Remixed audio will credit the original song and mark AI modifications, with compensation to artists and rights holders through YouTube’s partnership with Universal Music Group.
Part of YouTube's Dream Track toolset, this remix feature is built on DeepMind’s Lyria model, alongside other tools like a track-creation tool based on user humming.
Quick Gist
Adobe has integrated Firefly AI into Adobe Stock, allowing users to make generative edits and customize existing images, while ensuring compensation for original contributors (Read More).
Perplexity introduced AI-generated ads through sponsored follow-up questions within its search results, maintaining that these ads will not affect the impartiality of its search responses (Read More).
Google has launched an AI Scam Detection feature in beta for its Pixel Phone app in the US, which analyzes audio to warn users of potential scam calls (Read More).
Alibaba’s Qwen2.5 Coder is a new open-source AI model for coding that rivals or surpasses models like OpenAI’s GPT-4o and Anthropic’s Claude 3.5, showcasing Alibaba's competitive edge in open-source development (Read More).
At Adobe MAX 2024, Adobe introduced updates to its Firefly AI with new video capabilities and enhancements for Illustrator, Photoshop, and Lightroom, reiterating its commitment to ethical AI practices (Read More).
Apple launched Final Cut Pro 11 for Mac, introducing AI-powered tools for masking and captioning, spatial video editing, and improved editing options in the updated iPad version (Read More).
THAT’S IT FOR TODAY!
That’s it for today, see you tomorrow! 👋
If you have any questions, feedback, or requests, drop us an email at [email protected]. We love hearing from our readers! 😊
P.S. If this email was forwarded to you, you can sign up for free by clicking here!