Products & AppsModels & ReleasesOpenAI’s updated image generator can now pull information from the webOpenAI's ChatGPT Images 2.0 now integrates web search into its image generation pipeline, enabling the model to retrieve real-time information and produce more contextually accurate visuals from single prompts. The update marks a shift toward multimodal reasoning in generative imagery.The Verge — AI·Apr 2169
Models & ReleasesProducts & AppsChatGPT’s new Images 2.0 model is surprisingly good at generating textOpenAI's Images 2.0 model has notably improved at rendering text within generated images, a long-standing weakness for visual AI systems. The capability jump signals meaningful progress in multimodal generation and raises questions about downstream applications in design and content creation.TechCrunch — AI·Apr 2169
Models & ReleasesProducts & AppsOpenAI Beefs Up ChatGPT's Image Generation ModelOpenAI released ChatGPT Images 2.0, showing measurable gains in detail and text rendering but remaining weak outside English. The upgrade signals incremental progress in multimodal generation, though language barriers limit its global utility.WIRED — AI·Apr 2165
Business & FundingOpinion & AnalysisSam Altman throws shade at Anthropic’s cyber model, Mythos: ‘fear-based marketing’Sam Altman publicly criticized Anthropic's Mythos cyber security model as relying on fear-based marketing rather than substantive capability claims. The jab reflects escalating competitive tensions between OpenAI and Anthropic over model positioning and credibility in the enterprise security space.TechCrunch — AI·Apr 2158
Tools & CodeProducts & AppsMozilla Used Anthropic’s Mythos to Find and Fix 151 Bugs in FirefoxMozilla's Firefox team used Anthropic's Claude to identify and patch 151 bugs, demonstrating practical AI-assisted security workflows. The team cautioned that while AI won't fundamentally reshape cybersecurity, developers face a near-term adjustment period integrating these tools into their processes.WIRED — AI·Apr 2165
Products & AppsTools & CodeGoogle launches Deep Research and Deep Research Max agents to automate complex researchGoogle DeepMind rolled out Deep Research Max, an autonomous research agent built on Gemini 3.1 Pro that integrates web and proprietary data sources. Developers can now plug in financial feeds and specialized data via the Model Context Protocol, expanding agent capabilities beyond public information.The Decoder·Apr 2173
ResearchGeneralization at the Edge of StabilityResearchers model neural network training as random dynamical systems converging to fractal attractors rather than fixed points, introducing 'sharpness dimension' to explain why chaotic optimization regimes improve generalization. The work bridges Lyapunov theory and deep learning, offering theoretical grounding for why large learning rates often outperform conservative training.arXiv cs.LG·Apr 2162
ResearchSafe Continual Reinforcement Learning in Non-stationary EnvironmentsResearchers tackle the intersection of safe and continual reinforcement learning, addressing a gap where RL systems must adapt to changing real-world dynamics while maintaining safety constraints throughout training and deployment. The work targets physical control systems where transient safety violations during learning are unacceptable.arXiv cs.LG·Apr 2152
ResearchFASTER: Value-Guided Sampling for Fast RLResearchers propose FASTER, a technique that cuts computational cost of sampling-based RL policies by modeling action filtering as an MDP, enabling value-guided early termination during diffusion denoising rather than waiting for full generation.arXiv cs.LG·Apr 2158
ResearchFB-NLL: A Feature-Based Approach to Tackle Noisy Labels in Personalized Federated LearningResearchers propose FB-NLL, a federated learning framework that improves personalization across distributed devices by clustering users through feature-space analysis rather than training dynamics, making the system more robust to corrupted data and mislabeled examples.arXiv cs.LG·Apr 2152
Tools & CodeResearchVLA Foundry: A Unified Framework for Training Vision-Language-Action ModelsVLA Foundry unifies language, vision, and action model training in a single open-source codebase, eliminating the fragmented pipeline problem that has plagued prior robotics-focused AI efforts. The team released two model variants and benchmarked them on an open simulator, offering practitioners an end-to-end training stack from scratch or via pretrained backbones.arXiv cs.LG·Apr 2158
ResearchBenign Overfitting in Adversarial Training for Vision TransformersResearchers provide the first theoretical framework showing Vision Transformers can achieve robust generalization under adversarial training within specific signal-to-noise and perturbation conditions, resolving a gap between ViT empirical robustness and formal understanding.arXiv cs.LG·Apr 2158
ResearchAdaptive MSD-Splitting: Enhancing C4.5 and Random Forests for Skewed Continuous AttributesResearchers propose Adaptive MSD-Splitting, an improvement to the MSD-Splitting discretization technique for decision trees that dynamically adjusts binning thresholds to handle skewed data distributions. The method addresses a key limitation of the original approach, which struggled with real-world biomedical and financial datasets where asymmetry causes information loss.arXiv cs.LG·Apr 2142
ResearchDiscovering a Shared Logical Subspace: Steering LLM Logical Reasoning via Alignment of Natural-Language and Symbolic ViewsResearchers discovered that LLMs maintain a shared internal logical subspace bridging natural-language and symbolic reasoning, using Canonical Correlation Analysis to extract a low-dimensional representation that captures reasoning independent of surface form. This finding suggests LLMs don't need external symbolic solvers and could improve multi-step logical reasoning through better alignment of these dual views.arXiv cs.CL·Apr 2162
Business & FundingTools & CodeHyundai, DeepX Partner to Develop AI Platform for RoboticsHyundai and DeepX are building foundational AI infrastructure for next-generation robotics, targeting the core systems that will power autonomous machines at scale. The partnership signals automotive and robotics players moving beyond off-the-shelf models to develop proprietary AI stacks.AI Business·Apr 2155
Products & AppsPolicy & RegulationCelebrities will be able to find and request removal of AI deepfakes on YouTubeYouTube is rolling out likeness detection to celebrities, letting public figures search for and flag AI deepfakes of themselves on the platform. The feature automates what was previously manual takedown work, shifting enforcement burden onto talent themselves.The Verge — AI·Apr 2165
Policy & RegulationBusiness & FundingBuilding agent-first governance and securityAs AI agents proliferate in enterprises, security gaps are widening: non-human identities now outnumber human ones at some firms, creating new vectors for data theft and system compromise. Governance frameworks lag behind deployment, leaving organizations exposed to agent manipulation attacks.MIT Technology Review — AI·Apr 2177
ResearchEpistemic orientation in parliamentary discourse is associated with deliberative democracyResearchers developed an LLM-based metric to quantify whether parliamentary speech leans toward evidence or intuition, then applied it to 15 million speeches across seven countries since 1946. The analysis reveals correlations between evidence-based discourse and stronger democratic institutions, offering a scalable method for measuring epistemic quality in political communication.arXiv cs.CL·Apr 2158
ResearchPlanning in entropy-regularized Markov decision processes and gamesResearchers introduce SmoothCruiser, a planning algorithm that solves entropy-regularized MDPs and two-player games with polynomial sample complexity O(1/epsilon^4), addressing a gap where non-regularized settings lack worst-case guarantees.arXiv cs.LG·Apr 2152
Policy & RegulationBusiness & FundingClarifai deletes 3 million photos that OkCupid provided to train facial recognition AI, report saysClarifai deleted 3 million photos that OkCupid provided for facial recognition training, following an FTC settlement. The 2014 data-sharing arrangement between the dating app and the AI company—whose executives had financial ties to OkCupid—now faces regulatory consequences over undisclosed training practices.TechCrunch — AI·Apr 2165
ResearchAn Answer is just the Start: Related Insight Generation for Open-Ended Document-Grounded QAResearchers introduce a new task and dataset for improving QA systems beyond single-answer retrieval. SCOpE-QA contains 3,000 open-ended questions designed to train models that generate follow-up insights, enabling iterative refinement of answers rather than static responses.arXiv cs.CL·Apr 2152
ResearchPREF-XAI: Preference-Based Personalized Rule Explanations of Black-Box Machine Learning ModelsResearchers propose PREF-XAI, a framework that tailors model explanations to individual user preferences rather than applying one-size-fits-all interpretability methods. The approach treats explanation generation as a preference-learning problem, addressing a gap in XAI where cognitive constraints and user goals vary widely.arXiv cs.LG·Apr 2152
ResearchExploring Language-Agnosticity in Function Vectors: A Case Study in Machine TranslationResearchers found that function vectors—task representations extracted from multilingual LLMs during in-context learning—transfer across languages when trained on a single translation direction. Translation vectors learned from English-to-one-language pairs improved token ranking in unseen target languages, suggesting language-agnostic task encoding in decoder-only models.arXiv cs.CL·Apr 2152
ResearchLearning Hybrid-Control Policies for High-Precision In-Contact Manipulation Under UncertaintyResearchers propose hybrid position-force control policies that let reinforcement learning agents dynamically switch between force and position control for delicate manipulation tasks like connector insertion. A new training method called MATCH improves learning efficiency by handling contact mode transitions.arXiv cs.LG·Apr 2152
ResearchBudgeted Online Influence MaximizationResearchers propose a budget-constrained algorithm for selecting influencers in social ad campaigns, replacing traditional cardinality limits with real-world cost modeling. The approach improves regret bounds for both budget and cardinality settings under cascade diffusion models with semi-bandit feedback.arXiv cs.LG·Apr 2142
ResearchHardNet++: Nonlinear Constraint Enforcement in Neural NetworksHardNet++ enforces both linear and nonlinear constraints on neural network outputs during inference, addressing a gap in existing methods that either lack guarantees or work only for specific constraint types. The technique matters for safety-critical applications like control systems and autonomous decision-making where constraint violations carry real costs.arXiv cs.LG·Apr 2152
ResearchTools & CodeChat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural LanguageResearchers introduced Chat2Workflow, a benchmark and agentic framework for converting natural language into executable visual workflows, addressing the manual engineering bottleneck in industrial automation. The work tests whether LLMs can automate multi-step workflow design and error correction without human intervention.arXiv cs.CL·Apr 2158
ResearchFrom Top-1 to Top-K: A Reproducibility Study and Benchmarking of Counterfactual Explanations for Recommender SystemsResearchers unified evaluation of eleven counterfactual explanation methods for recommender systems, addressing fragmentation across datasets, metrics, and protocols that previously blocked fair comparison. The benchmarking framework assesses explainers across three dimensions, covering both native methods like LIME-RS and SHAP plus graph neural network approaches.arXiv cs.LG·Apr 2152
ResearchDisentangling Damage from Operational Variability: A Label-Free Self-Supervised Representation Learning Framework for Output-Only Structural Damage IdentificationResearchers propose a self-supervised learning framework using disentangled representations to identify structural damage from vibration signals while filtering out environmental noise. The approach uses an autoencoder with VICReg regularization to separate damage-induced changes from operational variability, addressing a key challenge in structural health monitoring.arXiv cs.LG·Apr 2142
ResearchPause or Fabricate? Training Language Models for Grounded ReasoningResearchers propose GRIL, a reinforcement learning framework that trains language models to recognize when they lack sufficient information for reliable inference, rather than confidently fabricating answers. The approach decomposes reasoning into clarification and pause stages, addressing a fundamental failure mode in LLM reasoning under incomplete data.arXiv cs.CL·Apr 2158