Category: Top News

  • xAI Shipped Its Latest Model, Grok-4.3, with 1M Token Context Window and Low Price API

    xAI Shipped Its Latest Model, Grok-4.3, with 1M Token Context Window and Low Price API

    IBL News | New York

    Elon Musk’s xAI shipped its latest LLM, Grok 4.3, marking a leap in performance over its competitors OpenAI, Anthropic, Google, and Chinese firms DeepSeek, Moonshot (Kimi), Alibaba (Qwen), z.ai, and others — despite still remaining below the latest models from OpenAI and Anthropic.

    In addition to its freewheeling personality and image generation policy, the marquee feature is the low price point for developers via the API, at $1.25 per million input tokens and $2.50 per million output tokens, compared to its direct predecessor Grok 4.2’s initial API pricing of $2/$6 per million input/output tokens.

    According to the company, Grok 4.3 is built with reasoning as an active, permanent state. It means that the model is designed to “think” before it speaks for every query, a strategy intended to maximize factual accuracy and handle complex, multi-step instructions.

    The model’s memory is equally expansive, featuring a 1 million-token context window — roughly equivalent to several thick novels.

    It is specifically optimized for agentic workflows—scenarios where an AI is not just answering a question but acting as an autonomous agent to complete a task.

    The model can design 9-slide PowerPoint decks, utilizing a “Sandwich Structure” (dark titles/conclusions with light content) and integrating data-driven decision matrices and humor.

    xAI also introduced Custom Voices, a sophisticated voice-cloning API and web-based voice cloning creation suite.

    This product allows users to clone their voice at high quality in a minute or two. Once cloned, the “voice ID” can be used across xAI’s Text-to-Speech (TTS) and Voice Agent APIs.

    Access to the new Voice Agent API is billed at a flat rate of $3.00 per hour ($0.05 per minute) for speech-to-speech interactions.

  • Owned AI Infrastructure and Data Sovereignty Emerge as Dominant Themes Across Industries

    Owned AI Infrastructure and Data Sovereignty Emerge as Dominant Themes Across Industries

    IBL News, Boston

    “Data quality, not model size, is the primary bottleneck in AI performance,” said Datology’s CEO, Ari Morcos, at the ODSC AI East 2026 in Boston this week. “Better training data and smaller models outperform larger ones trained on slop,” he explained.

    Ami Bhatt, FDA Chief Innovation Officer and Chair of the American College of Cardiology, discussed AI in clinical decision support and the FDA’s evolving framework for validating AI systems in healthcare. “At the FDA, we are building regulatory infrastructure for AI — not blocking it, but demanding rigor.”

    Across healthcare, finance, government, legal, education, manufacturing, and energy, a pattern emerged: the organizations moving fastest on AI are the ones that have solved the data privacy and deployment ownership equation, performing at frontier quality while keeping patient data locked down. They’re not asking “which model” but “where does it run, who owns the data, and can we audit every decision?”

    By industry sector, the ODSC East 2026 generated several outcomes:

    In Healthcare and Biopharma, AI in drug discovery dominated. Generative AI for molecular design is now producing novel candidate compounds in hours instead of months. Multiple sessions covered AI-driven biomarker discovery — using foundation models to identify disease signatures in genomic data that traditional bioinformatics pipelines miss entirely.

    At major health systems, medical imaging AI has evolved into predictive modeling for clinical outcomes.

    Healthcare sessions circled back to the issue of data privacy, as foundation models require massive datasets to perform, but HIPAA, patient consent, and institutional data governance impose hard constraints on data sharing. Synthetic data generation and federated learning emerged as the most discussed workarounds, but neither is mature enough for enterprise-scale deployment yet.

    In Government and Defense, federal agencies are moving from “should we use AI?” to “how do we deploy AI in air-gapped, classified, and compliance-heavy environments?” FedRAMP, NIST 800-53, and ITAR requirements dominated the conversation.

    The agencies that are moving fastest — DOD, intelligence community, DHS — are the ones with the most acute operational pain points and the budget to solve them.

    Google’s signing a classified AI deal with the Pentagon (reported during the conference week) added urgency to the discussion. The open question was whether the government AI infrastructure should be owned by hyperscalers or if agencies would build sovereign capability.

    An important takeaway was that only American-made models and compliance-first would have a massive moat.

    In Financial Services, a recurring theme was zero tolerance for AI hallucinations, as they could be a compliance violation. Multiple sessions addressed guardrails, output validation, and human-in-the-loop architectures specifically designed for regulated environments.

    Another takeaway is that the shift to open source in finance was real, and self-hosted, private deployment is becoming the default architecture. Several financial services practitioners described their institutions moving from proprietary APIs (OpenAI, Anthropic) to self-hosted open-weight models (Llama, DeepSeek, Mistral), driven not by cost but by data sovereignty.

    When your trading strategies, M&A documents, and client portfolios are the data, sending them to a third-party API is a non-starter.

    Financial services want frontier-quality AI but absolutely cannot accept the data exposure inherent in shared platforms.

    In Manufacturing and Supply Chain, AI for robotics reflected the convergence of foundation models with physical systems. Sessions covered reinforcement learning for warehouse automation, computer vision for quality inspection, and multi-agent coordination for logistics.

    The manufacturing story is about integration, not intelligence. The models are capable enough — the bottleneck is connecting AI to legacy ERP systems, SCADA networks, and supply chain databases that were built decades before APIs existed. Several sessions addressed the “last mile” problem of getting AI outputs into SAP, Oracle, and custom MES systems.

    Participants agreed that manufacturing AI is a data integration challenge first and a model challenge second. The organizations winning here are the ones who’ve invested in data infrastructure, not just model training.

    In Legal and Compliance, the recurring pattern was that law firms and in-house legal teams were deploying LLMs for document review, contract analysis, and legal research, but with extreme caution.

    Attorney-client privilege is the hard constraint. Unlike other industries where data privacy is a regulatory concern, in legal it’s a constitutional one. Multiple speakers from regulated industries described building air-gapped AI systems specifically so privileged communications never touch external infrastructure. The phrase “private deployment” came up more in legal-adjacent sessions than anywhere else.

    AI governance frameworks — as Shoshana Rosenberg presented — are being adopted fastest by legal departments, not IT departments. Lawyers understand regulatory risk intuitively and are building the policies and controls that other functions are still debating.

    Regarding Energy, Utilities, and Infrastructure companies, practitioners highlighted the need for AI that can be deployed reliably and deterministically on local hardware, without cloud dependencies, given that operations often occur on factory floors, in substations, and on drilling platforms, where connectivity is unreliable or prohibited.

    In terms of Software Engineering and Devtools, developers at the conference analyzed how to evaluate whether AI-written code is correct, secure, and maintainable. Evaluation systems, test generation, and multi-agent code review (in which multiple AI agents check each other’s work) were the most-discussed engineering patterns.

    Karen Zhou from Anthropic’s Claude Code team and Robert Brennan from All Hands AI (OpenHands/OpenDevin) enlightened the discussion.

    Software developers mostly agreed that competitive edge was in AI systems that can reason across codebases, call external services, and operate production infrastructure — not just autocomplete.

    Finally, in Higher Education and Research, Brown University’s Michael Littman, Boston University’s Mohammad Soltanieh-ha, Bentley University’s Noah Giansiracusa, and MIT’s Max Tegmark delivered major sessions. The academic-to-production pipeline has never been shorter.

    Many institutions are now building their own AI operating systems — course-specific agents, research assistants, administrative automation tools, and student support chatbots. Multiple sessions referenced institutions deploying self-hosted LLMs on their own cloud infrastructure to maintain FERPA compliance and student data privacy.

    One takeaway was that institutions that own their AI infrastructure, rather than subscribing to shared platforms, would emerge as the leaders.

     

  • Practitioners at the ODSC Event Examined Why AI Enterprise Projects Failed and Analyzed Tectonic Shifts

    Practitioners at the ODSC Event Examined Why AI Enterprise Projects Failed and Analyzed Tectonic Shifts

    Mikel Amigot, IBL News | Boston

    Around 3,500 AI practitioners (mostly data scientists, engineers, researchers, and business leaders) are attending the 11th ODSC (Open Data Science Conference) East conference this week in Boston, with a dominant theme: it’s time to execute Agentic AI across industries.

    With 300+ hours of expert-led content, 250+ speakers, and 15+ dedicated tracks, participants shared insights on the race to build the infrastructure that will define who controls AI in production.

    The conference has surfaced three tectonic shifts:

    1. Agents are replacing chatbots — not as a trend, but as a deployment pattern. Organizations that are still building Q&A bots are already behind.
    2. Governance is infrastructure, not paperwork — the organizations that build technical governance (audit trails, verification, guardrails) will move faster than those that skip it.
    3. The model is commoditizing; the stack is the moat — with 8 frontier models shipping in a single week and open-source catching up to proprietary, the competitive advantage has shifted from “which model” to “what infrastructure do you own.”

    ODSC dedicated an entire track to Agentic AI & Workflow Automation for the first time. Gartner’s prediction — 40% of enterprise apps will embed AI agents by the end of 2026 (up from 5% today) — was quoted in several keynotes.

    • MIT’s Max Tegmark challenged the room to move beyond “vibe coding” toward provably correct AI systems.
    • Pedro Domingos introduced “tensor logic” as a unifying language for AI.
    • Nouha Dziri (Cohere Labs) argued that hallucination mitigation requires architectural changes, not just better prompting.
    • Olivia Buzek from IBM ran a pre-conference workshop on building responsible AI agents with open-source tools.

    Rehgan Bleile (AlignAI) broke down why enterprise AI keeps failing to scale: Organizations invest in models and platforms but don’t invest in the organizational change management, incentive alignment, and cross-functional governance required to actually operationalize AI. Her prescription: treat AI deployment like an organizational redesign, not a technology upgrade.

    Adam Tauman Kalai (OpenAI) delivered a talk titled “Why Language Models Hallucinate,” notable because it came from inside OpenAI itself. Kalai explained the mathematical reasons behind hallucination as a phenomenon, positioning it not as a bug to be fixed but as an inherent property of probabilistic generation that needs to be managed through system design.

    Governance went mainstream. Shoshana Rosenberg (Women in AI Governance / Logical AI Governance) made the case that AI governance has moved from a compliance checkbox to a strategic competitive advantage. Her session on building future-ready governance frameworks was one of the most attended in the leadership track.

    Shoshana Rosenberg explained that organizations that build governance infrastructure now — not just policies, but actual technical controls, audit trails, and decision frameworks — will move faster in the long run than those who skip it to ship faster today.

  • Google Made its Veo Video Model Available for Advertisers

    Google Made its Veo Video Model Available for Advertisers

    IBL News | New York

    Google made its Veo video model available for marketers using Ads this month.

    The company’s goal is to make the video-creative process accessible without requiring a dedicated production budget, time, or expertise.

    Essentially, it lets advertisers upload and turn three static images into 10-second-long videos with natural motion, designed specifically for YouTube.

    Users upload images to Google Ads’ Asset Studio, which turn into ready-to-serve ads within minutes using customizable templates.

    Combined with Nano Banana, advertisers can swap backgrounds, adjust messages, and tailor content to audience interests.

    This is an example created on LinkedIn.

     

  • Google Added New Features to ‘Vids’, Its Video Editor App

    Google Added New Features to ‘Vids’, Its Video Editor App

    IBL News | New York

    Google added new features to its video editor app, Vids, including the ability to use natural language prompts to customize avatars and have them interact with products or equipment. Based on the video’s theme, users can tweak characters’ appearance, change their apparel, and create new backgrounds using prompts.

    Vids supports adding the Veo 3.1 video-generation model, which can create eight-second clips within the video editing tool. It also supports Lyria 3 and Lyria 3 Pro for adding music and sound effects to clips.

    Google is giving out 10 free video generations per month. The company said Google AI Ultra and Workspace AI Ultra accounts can generate up to 1,000 Veo videos per month.

    The tool allows users to export finished videos directly to YouTube, saving the hassle of downloading and uploading them to their channel.

    The video suite is also adding a new screen-recording Chrome extension that allows users to capture the screen with audio or video.

    In February, the company added 2D and 3D cartoon-style avatars and added language support for seven new voice-over languages, including French, German, Italian, Korean, Portuguese, Spanish, and Japanese.

  • Anthropic Adds More Free Self-Study Courses for Developers and Educators

    Anthropic Adds More Free Self-Study Courses for Developers and Educators

    IBL News | New York

    Anthropic introduced new free self-study courses for developers, students, educators, and AI enthusiasts at its Skilljar-based training portal.

    This online platform offers courses on key topics, including the use of the Claude AI-based assistant for daily work activities, such as Claude 101.

    Other key training paths include Claude Code in Action and Building with the Claude API.

    These classes explain how to integrate the Claude AI-based system into development environments and to use context-driven tools and the Model Context Protocol (MCP) to connect AI systems with external software systems.

    The Anthropic learning platform also offers AI Fluency-based training paths, enabling users to collaborate with AI systems and plan their academic and professional lives.

    Upon completion of the training programs, users can obtain certification to acknowledge their skills.

    Users would need a Skilljar account to have access to the course content.

  • DeepSeek Issued Its Open Source Model V4 Preview, with a Cost-Effective 1M Context Length

    DeepSeek Issued Its Open Source Model V4 Preview, with a Cost-Effective 1M Context Length

    IBL News | New York

    Chinese AI startup DeepSeek launched two preview versions of its newest large language model yesterday: DeepSeek V4 Flash and V4 Pro, with context windows of 1 million tokens each — enough to include large codebases or documents in prompts.

    DeepSeek V4 Pro costs $1.74/1M input and $3.48/1M output tokens while V4 Flash costs $0.14/1M input and $0.28/1M output tokens, both the cheapest in their class.

    The company stated that it has almost “closed the gap” with current leading models, both open and closed, on reasoning benchmarks.

    The company claims its new V4-Pro-Max model outperforms its open-source peers across reasoning benchmarks, and outstrips OpenAI’s GPT-5.2 and Gemini 3.0 Pro on some tasks. In coding competition benchmarks, DeepSeek said both V4 models’ performance is “comparable to GPT-5.4.”

    The Pro model has a total of 1.6 trillion parameters (49 billion active), making it the largest open-weight model available, outstripping Moonshot AI’s Kimi K 2.6 (1.1 trillion), MiniMax’s M1 (456 billion), and more than double DeepSeek V3.2 (671 billion). The smaller, V4 Flash has 284 billion parameters (13 billion active).

    Notably, DeepSeek V4 is much more affordable than any frontier model available today.

    • The smaller V4 Flash model costs $0.14 per million input tokens and $0.28 per million output tokens, undercutting GPT-5.4 Nano, Gemini 3.1 Flash, GPT-5.4 Mini, and Claude Haiku 4.5.

    • The larger V4 Pro model, meanwhile, costs $0.145 per million input tokens and $3.48 per million output tokens, also undercutting Gemini 3.1 Pro, GPT-5.5, Claude Opus 4.7, and GPT-5.4.

    DeepSeek said both models are more efficient and performant than DeepSeek V3.2 due to architectural improvements.

    Both V4 Flash and V4 Pro support text only, unlike many of their closed-source peers, which support understanding and generating audio, video, and images.

    The launch of DeepSeek V4 Flash and V4 Pro comes a day after the U.S. accused China of stealing American AI companies’ IP on an industrial scale using thousands of proxy accounts. DeepSeek itself has been accused by Anthropic and OpenAI of “distilling,” essentially copying, their AI models.

  • OpenAI Rolled Out ChatGPT Images 2.0, competing with Google’s Nano Banana

    OpenAI Rolled Out ChatGPT Images 2.0, competing with Google’s Nano Banana

    IBL News | New York

    OpenAI rolled out this week ChatGPT Images 2.0, a new tool that allows users to search the web and create visual explainers based on uploaded files, reasoning through the structure of the image before generating it.

    According to the company, this image generator creates more sophisticated images, with improvements in its ability to follow instructions and preserve the details the user chooses.

    It creates up to 8 images at once with thinking enabled, all while maintaining the same characters, objects, and styles across scenes.

    OpenAI said this should make it easier to generate things like manga pages, a series of social graphics, or design plans for every room in a house.

    It generates images with a resolution of up to 2K and in a range of aspect ratios, from wider formats such as 3:1 to taller ones like 1:3.

    OpenAI first released ChatGPT Images last year and launched its latest major update in December, adding faster image generation and improved photo-editing capabilities.

    Since then, competition has only been getting stronger, with the arrival of tools like Google’s Nano Banana Pro and Microsoft’s MAI-Image-2.

    ChatGPT Images 2.0 is available to all ChatGPT and Codex users starting today.

  • Google Announces Its Enterprise Agent Platform and Unveils Powerful New AI Chips

    Google Announces Its Enterprise Agent Platform and Unveils Powerful New AI Chips

    IBL News | New York

    Google CEO Sundar Pichai announced yesterday at the opening of the Google Cloud Next conference in Las Vegas, the Gemini Enterprise Agent Platform.

    Google’s platform, delivered through the Gemini Enterprise app, is geared toward IT and technical teams and is intended for building and managing agents at scale. It’s the company’s answer to Amazon’s Bedrock AgentCore and to Microsoft Foundry.

    “It brings together the best of Vertex AI with transformational new features, including Agent Studio, Agent-to-Agent Orchestration, Agent Registry, Agent Identity, Agent Gateway, Agent Observability, and much more,” explained Thomas Kurian, CEO of Google Cloud.

    The Agent platform performs trigger-based processes, edits files without switching apps, has an Inbox for managing agent activity, and offers Skills to create shortcuts for repetitive tasks and Canvas to create and edit files.

    The underlying models are Google’s Gemini LLM, Nano Banana 2 image generator, and Anthropic’s Claude — with support for Claude Opus, Sonnet, and Haiku, including the new Opus 4.7 launched last week.

    At the same event, Alphabet Inc.’s Google Cloud unveiled its eighth generation of TPUs, including the TPU 8t for training and the TPU 8i for inference, which will be generally available later this year. These tensor processing units, or TPUs, are a homegrown chip that’s designed to take on NVIDIA and become a greater force in AI

    In addition, Google shared plans to turn the Chrome browser into an AI coworker for enterprise users at the workplace. It then uses AI to handle various tasks such as booking travel, entering data, scheduling meetings, and other related tasks in web-based work.

    Google suggested that infusing AI into Chrome could be used to input information into the CRM based on content in a Google Doc, compare vendor pricing across tabs, summarize a candidate’s portfolio before an interview, pull key data from a competitor’s product page, and more.

    These workflows will still require the physical user to manually review and confirm the AI’s input before any action.

    The idea is to help speed up these more tedious tasks, freeing people to focus on what Google calls “strategic work.”

  • More Criticism from Students and Faculty on Datmouth University and Anthropic Partnership

    More Criticism from Students and Faculty on Datmouth University and Anthropic Partnership

    IBL News | New York

    A new issue has added criticism to Dartmouth’s institution-wide partnership with Anthropic, signed last December, following complaints from students and faculty about copyright infringement.

    “A more pressing concern is Anthropic’s relationship with the Pentagon,” wrote a student at The Dartmouth college newspaper, echoing an extended view.

    The public rejection of Anthropic’s CEO’s use of its AI tools in fully autonomous defense systems didn’t erase the protests by students and staff.

    “Its primary model, Claude, still serves a key role in the Pentagon’s arsenal, as it has served as an integral part of Palantir’s Maven Smart System, which provides the Department of Defense with real-time targeting recommendations in the ongoing conflict against Iran.”

    The Wall Street Journal reported that Claude was involved in the 1,000 strikes at the beginning of the U.S. military campaign in Iran.

    In its conflict in Gaza, Israel Defense Forces used “Lavender,” an AI-powered target identification software that analyzed surveillance data to score a Palestinian’s likelihood of being a Hamas militant. This tool reportedly has a 10 percent false-positive rate, resulting in harm to civilians.