Unveiling the forefront of innovation, decoding the language of algorithms, and illuminating the transformative impact of artificial intelligence on our rapidly evolving world.
Stay informed, stay ahead, as we navigate the cutting edge of AI developments, one byte at a time. Your gateway to the future starts here.
- The AI talent wars are just getting startedby Alex Heath on December 20, 2024 at 8:26 pm
Naveen Rao, VP of AI at Databricks. | Naveen Rao / The Verge For my last issue of the year, I’m focusing on the AI talent war, which is a theme I’ve been covering since this newsletter launched almost two years ago. And keep reading for the latest from inside Google and Meta this week. […]
- OpenAI teases new reasoning model—but don’t expect to try it soonby Kylie Robison on December 20, 2024 at 6:55 pm
Image: Alex Parkin / The Verge For the last day of ship-mas, OpenAI previewed a new set of frontier “reasoning” models dubbed o3 and o3-mini. The Verge first reported that a new reasoning model would be coming during this event. The company isn’t releasing these models today (and […]
- Google Search will reportedly have a dedicated ‘AI Mode’ soonby Emma Roth on December 20, 2024 at 2:22 pm
Illustration: The Verge Google is planning to add a new “AI Mode” to its search engine, according to a report from The Information. The company will reportedly display an option to switch to AI Mode from the top of the results page, allowing you to access an interface similar to its […]
- Google reveals AI ‘reasoning’ model that ‘explicitly shows its thoughts’by Emma Roth on December 19, 2024 at 6:46 pm
Illustration: The Verge Google has introduced a new AI “reasoning” model capable of answering complex questions while also providing a rundown of its “thoughts,” as reported earlier by TechCrunch. The model, called Gemini 2.0 Flash Thinking, is still experimental and will likely […]
- Instagram teases AI editing tools that will completely reimagine your videosby Jess Weatherbed on December 19, 2024 at 3:39 pm
The puppet influencers are coming...in 2025. | Image: The Verge / Adam Mosseri Instagram is planning to introduce a generative AI editing feature next year that will allow users to “change nearly any aspect of your videos.” The tech is powered by Meta’s Movie Gen AI model according to […]
- Microsoft is testing live translation on Intel and AMD Copilot Plus PCsby Emma Roth on December 19, 2024 at 2:13 pm
Image: Microsoft Microsoft is previewing live translation on Intel and AMD-based Copilot Plus PCs. The feature is rolling out now to Windows 11 Insiders in the Dev Channel, allowing users to translate audio from over 44 languages into English subtitles. Live translation, which initially […]
- You can now call 1-800-CHATGPTby Kylie Robison on December 18, 2024 at 6:31 pm
OpenAI For the 10th day of “ship-mas,” OpenAI rolled out a way to call ChatGPT for up to 15 minutes for free over the phone using 1-800-CHATGPT. The feature was a project spun up just a few weeks ago, OpenAI’s chief product officer Kevin Weil said on the livestream. Users can now call […]
- YouTube says that soon, its tech will be able to find AI copies of celebs and creatorsby Emma Roth on December 17, 2024 at 8:44 pm
Illustration: Alex Castro / The Verge YouTube is partnering with the Creative Artists Agency (CAA) to help creators identify content using their AI-generated likenesses on the platform and submit removal requests. The company will test the controls with celebrities and athletes early next […]
- How to add extensions to Geminiby Barbara Krasnoff on December 17, 2024 at 7:00 pm
Image: The Verge I was driving home the other day and wanted to call my partner and let him know that I was stuck in traffic. (Not an unusual event on Brooklyn’s Belt Parkway.) I’ve got a relatively old car (it’s a 2007 model, so we’re talking no real smarts), and so I depend on my […]
- Nvidia’s $249 dev kit promises cheap, small AI powerby Wes Davis on December 17, 2024 at 6:27 pm
Nvidia announced the latest in its Jetson Orin Nano AI computer line, the Jetson Orin Nano Super Developer Kit. Sort of like a Raspberry Pi but for powerful AI processing, the tiny $249 computer packs more of an AI processing punch than the kit did before — for half the price. It’s available to […]
- From Token to Conceptual: Meta introduces Large Concept Models in Multilingual AIby Synced on December 18, 2024 at 2:15 am
A research team at Meta introduces the Large Concept Model (LCM), a novel architecture that processes input at a higher semantic level. This shift allows the LCM to achieve remarkable zero-shot generalization across languages, outperforming existing LLMs of comparable size. The post From Token to […]
- NVIDIA’s Hybrid: Combining Attention and State Space Models for Breakthrough Performance of Small Language Modelsby Synced on December 14, 2024 at 3:00 pm
An NVIDIA research team proposes Hymba, a family of small language models that blend transformer attention with state space models, which outperforms the Llama-3.2-3B model with a 1.32% higher average accuracy, while reducing cache size by 11.67× and increasing throughput by 3.49×. The post […]
- From Response to Query: The Power of Reverse Thinking in Language Modelsby Synced on December 12, 2024 at 8:58 pm
In a new paper Time-Reversal Provides Unsupervised Feedback to LLMs, a research team from Google DeepMind and Indian Institute of Science proposes Time Reversed Language Models (TRLMs), a framework that allows LLMs to reason in reverse—scoring and generating content in a manner opposite to the […]
- Yann LeCun Team’s New Research: Revolutionizing Visual Navigation with Navigation World Modelsby Synced on December 9, 2024 at 9:01 pm
In a new paper Navigation World Models, a research team from Meta, New York University and Berkeley AI Research proposes a Navigation World Model (NWM), a controllable video generation model that enables agents to simulate potential navigation plans and assess their feasibility before taking […]
- The Future of Vision AI: How Apple’s AIMV2 Leverages Images and Text to Lead the Packby Synced on December 8, 2024 at 12:43 am
An Apple research team introduces AIMV2, a family of vision encoders that is designed to predict both image patches and text tokens within a unified sequence. This combined objective enables the model to excel in a range of tasks, such as image recognition, visual grounding, and multimodal […]
- Redefining Music AI: The Power of Sony’s SoniDo as a Versatile Foundation Modelby Synced on December 5, 2024 at 8:17 pm
In a new paper Music Foundation Model as Generic Booster for Music Downstream Tasks, a Sony research team presents SoniDo, a groundbreaking music foundation model that offers robust framework for improving the effectiveness and accessibility of music processing. The post Redefining Music AI: The […]
- DeepMind’s Socratic Learning with Language Games: The Path to Self-Improving Superintelligenceby Synced on November 29, 2024 at 7:15 pm
Researchers from Google DeepMind introduce the concept of "Socratic learning." This refers to a form of recursive self-improvement in artificial intelligence that significantly enhances performance beyond the initial data or knowledge available to the system, as well as a practical framework to […]
- Revolutionizing AI on a Budget: Apple’s Roadmap for Small Language Models Training Successby Synced on November 29, 2024 at 12:22 am
Apple researchers conducted a systematic study of the computational bottlenecks and cost-efficiency of training SLMs. Their work evaluates training strategies across diverse cloud infrastructure setups, offering practical insights for improving efficiency and reducing costs. The post […]
- Redefines Consistency Models”: OpenAI’s TrigFlow Narrows FID Gap to 10% with Efficient Two-Step Samplingby Synced on November 27, 2024 at 2:20 am
OpenAI researchers introduces TrigFlow, a simplified theoretical framework that identifies the key causes of training instability of consistency models and addresses them with novel improvements in diffusion process parameterization, network architecture, and training objectives. The post Redefines […]
- Precision in Pixels: NVIDIA’s Edify Image Model Combines High Quality with Unmatched Controlby Synced on November 26, 2024 at 12:18 am
In a new paper Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models, an NVIDIA research team introduces Edify Image—a suite of pixel-based diffusion models that achieve high-resolution image synthesis with exceptional control and precision. The post Precision in […]
- Amazon's AI Shopping Guides helps you research less and shop more. Here's how it workson December 20, 2024 at 8:34 pm
The sheer abundance of deals during the holiday season can get overwhelming. Amazon's guides help US customers navigate more than 100 product types.
- OpenAI unveils its most advanced o3 reasoning model on its last day of 'shipmas'on December 20, 2024 at 7:29 pm
For 12 days straight, OpenAI unveiled 'new things, big and small.' Here's what's new today and a full round-up of all the announcements.
- The last day of '12 days of OpenAI' is expected to bring biggest drop yeton December 20, 2024 at 5:04 pm
If you thought the o1 reasoning models were impressive, you may be in for a treat.
- This free AI training from IBM could boost your resume in 10 hourson December 20, 2024 at 2:08 pm
I spent a weekend earning my digital credential in AI from IBM. The last session was my favorite.
- Your Instagram videos will never be the same after these AI editing tools roll outon December 19, 2024 at 8:58 pm
With a simple text prompt, creators will be able to change outfits, backgrounds, and more. (Meta's Mosseri turned himself into a puppet.)
- Anthropic's Claude 3 Opus disobeyed its creators - but not for the reasons you're thinkingon December 19, 2024 at 8:15 pm
Anthropic found its model can take 'anti-Anthropic' action and trick training processes - a 'serious question' for safety.
- 3 holiday email scams to watch for - and how to stay safeon December 19, 2024 at 6:13 pm
Some of the messages in your Gmail inbox this season are not very nice. Google provides guidance on protecting yourself from the naughty ones.
- The window to apply for Perplexity's 2025 college AI program is closing - how to sign upon December 19, 2024 at 4:57 pm
The Perplexity Campus Strategist program provides a unique opportunity for students interested in AI, but you need to apply quickly.
- You can access the latest DALL-E 3 model for free, just not through ChatGPTon December 19, 2024 at 4:32 pm
Access OpenAI's most advanced image-generating model on Bing Image Creator for free.
- Agents are the 'third wave' of the AI revolutionon December 19, 2024 at 4:24 pm
How agentic AI is similar - and different - from its predecessor, generative AI.
- The 5 stages of digital twin developmenton December 19, 2024 at 4:16 pm
Digital twins are like flight simulators for business, but they're not as quick and easy to implement as you might think.
- No one wants another chatbot. This is the AI we actually needon December 19, 2024 at 3:43 pm
Fundamental advancements are still needed to turn today's chatbots into something more -- something that can sense when we're stressed or overwhelmed, not just when we need another PDF summarized.
- IBM's new enterprise AI models are more powerful than anything from OpenAI or Googleon December 19, 2024 at 1:29 pm
Bigger, better, and all open-source AI for enterprises: IBM releases its Granite 3.1 Large Language Models.
- How to use ChatGPT to summarize a book, article, or research paperon December 18, 2024 at 9:49 pm
Faced with a long document or a dense book? Here's how ChatGPT can help summarize the key points.
- Gemini Advanced users can now access Google's most experimental modelon December 18, 2024 at 9:45 pm
If you need help with coding, math, and reasoning, Gemini 2.0 Flash is the model for you.
- AI software startups set to take over $12 trillion US services industryon December 18, 2024 at 4:49 pm
Areas resistant to automation - like legal services and healthcare - are attracting novel applications that could even displace human workers, according to a Bank of America report.
- The open-source tools that could disrupt the entire IT incident management marketon December 18, 2024 at 2:49 pm
Open-source tools like Grafana Labs and AI-driven AIOps are shaking up incident management, challenging PagerDuty and streamlining IT problem-solving and code fixes. Here's why it matters.
- This hidden Apple feature turns your iPhone or iPad into an AI image generatoron December 18, 2024 at 2:12 pm
With Image Playground, you can generate images based on themes and other concepts, your own descriptions, and photos from your device's library.
- The top mobile AI features that Apple and Samsung owners actually useon December 18, 2024 at 1:19 pm
And why some users are avoiding the latest AI features on their phones.
- This agentic AI platform claims to speed development from 'months to days'on December 18, 2024 at 11:00 am
Blitzy claims its agents can optimize any model for reasoning, all while eliminating errors. Here's how it works, and where it might fall short.