Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.
| Episode | Date |
|---|---|
|
Claude Fable 5 Isn’t Just a Better Model: It’s a New AI Runtime
|
Jun 10, 2026 |
|
The EML Operator: One Primitive to Rule All Mathematics
|
May 13, 2026 |
|
OpenAI MRC, SRv6, and the Architecture of Frontier AI Supercomputers
|
May 08, 2026 |
|
Inside the Machine: Training GPT-5, the Memory Wall, and the Math of MoE
|
May 01, 2026 |
|
DeepSeek-V4: The Million-Token Efficiency Leap | Open Source SOTA
|
Apr 27, 2026 |
|
Breaking the Quadratic Bottleneck with DeepSeek-V4’s Hybrid Attention
|
Apr 27, 2026 |
|
Claude Desktop’s Silent Sandbox Bypass: The Undocumented Browser Bridge
|
Apr 24, 2026 |
|
Forensic Audit of Anthropic’s Native Messaging Backdoor
|
Apr 24, 2026 |
|
The $60 Billion Synergy: Architecting the SpaceX + Cursor AI "Colossus" | Neural Intel Podcast
|
Apr 24, 2026 |
|
The Jackrong Playbook: Mastering Claude 4.6 Opus Distillation with Unsloth and LoRA
|
Apr 20, 2026 |
|
Inside the Claude Opus 4.7 Orchestration Layer - Deferred Tools & Agentic Code
|
Apr 17, 2026 |
|
Electrons to Tokens: The Technical Architecture of Nvidia’s AI Monopoly
|
Apr 16, 2026 |
|
Hermes Agent’s Memory Architecture and the Future of Agentic RL
|
Apr 14, 2026 |
|
200 Gigawatts or Bust: Dylan Patel on the Engineering Reality of AGI Scaling
|
Apr 12, 2026 |
|
The Muse Spark Revolution: Dissecting Meta's 2026 Architectural Pivot & The Triad of Truth | Neural Intel Podcast
|
Apr 09, 2026 |
|
Synaptic Persistence and Mushroom Body Neurogenesis: The Architecture of Metamorphic Memory
|
Apr 09, 2026 |
|
Engineering Sovereign Knowledge Bases with Andrej Karpathy’s Automated Architect
|
Apr 07, 2026 |
|
The Mercor AI Breach: National Security Crisis or a Wake-Up Call for the AI Industry?
|
Apr 03, 2026 |
|
BREAKING: Massive Mercor AI Data Breach - SOTA Training Data Leaked from Meta, Apple, & Amazon
|
Apr 03, 2026 |
|
Did Anthropic Just Hand the Keys to AI Coding to Everyone? The Huge Claude Code Leak Explained
|
Apr 02, 2026 |
|
The Claude Code Leak: Decoding Anthropic’s Self-Healing Memory and Secret "KAIROS" Agent
|
Apr 02, 2026 |
|
Is AI Censorship Over? The G0DM0D3 "Liberated Chat" Breakthrough
|
Mar 29, 2026 |
|
Is Traditional Computing Dead? NVIDIA's Jensen Huang on the "iPhone of Tokens"
|
Mar 26, 2026 |
|
The Bio-Computer Architecture: Declassified CIA Mechanics for Synthetic Consciousness
|
Mar 25, 2026 |
|
The End of the Human Bottleneck: Andrej Karpathy on Auto-Research and Recursive AI
|
Mar 24, 2026 |
|
Is Open Source Dead? Inside the Cursor Composer 2 vs. Kimi License Controversy
|
Mar 22, 2026 |
|
Is Residual Scaling Obsolete? Introducing Attention Residuals
|
Mar 17, 2026 |
|
The Sequence-Depth Breakthrough: Inside Kimi Team's Attention Residuals
|
Mar 16, 2026 |
|
Beyond the Prompt: Architecture of the Qwen-Agent Ecosystem and Qwen3.5
|
Mar 12, 2026 |
|
Beyond the Chatbot: Engineering "Forever-Agents" with Hermes Agent and OpenClaw
|
Mar 10, 2026 |
|
Nanochat: How Karpathy Automated AI Evolution with NVIDIA ClimbMix
|
Mar 08, 2026 |
|
1 Million Tokens: Breakthrough or Marketing Stunt? The GPT-5.4 Technical Deep Dive
|
Mar 06, 2026 |
|
Qwen 3.5: Exodus, Restructuring, Betrayal, and the Future of Chinese AI
|
Mar 04, 2026 |
|
The Mac mini Guide to OpenClaw and Local AI
|
Mar 02, 2026 |
|
The Neural Intel Op Ed: Engineering a Post-Natural Language for the AI Era
|
Mar 01, 2026 |
|
Andrej Karpathy on the "Claw" Revolution: Are AI Agents Obsolete?
|
Feb 28, 2026 |
|
10 Million Tokens and Beyond: Why Recursive AI is the Next Scaling Frontier
|
Feb 21, 2026 |
|
The Grok 4.20 Manifesto: Multi-Agent Logic and the Quest for Unfiltered Truth
|
Feb 18, 2026 |
|
The End of Memory Bottlenecks: How Fiber Optics and Ganged Flash Power Trillion-Parameter Models
|
Feb 16, 2026 |
|
Interview with Dario Amodei from Anthropic: Inside the $100B "Big Blob of Compute" & The 2030 AGI Certainty
|
Feb 15, 2026 |
|
The OpenClaw Saga: Peter Steinberger on Self-Modifying AI and the Age of the Lobster
|
Feb 15, 2026 |
|
Inside the 180 Billion HKD Breakthrough: How MiniMax M2.5 Scaled Agentic RL
|
Feb 14, 2026 |
|
The 744B Parameter Giant: How GLM-5 and Domestic Chips Redefine the Global AI Order
|
Feb 12, 2026 |
|
The OpenClaw Security Crisis: Can We Control Autonomous AI Swarms?
|
Feb 04, 2026 |
|
Is Consciousness Only in Your Head?
|
Jan 29, 2026 |
|
Methods and Applications of Parametric Sensitivity Analysis
|
Jan 22, 2026 |
|
The Architecture of Choice: Scaling MIT’s Decision Algorithms
|
Jan 19, 2026 |
|
The Logographic Advantage: How China’s Ancient Language is Powering Next-Gen AI | Neural Intel Deep Dive
|
Jan 09, 2026 |
|
Deep Learning Deep Dive: From Neural Networks to Differentiable Programming
|
Jan 07, 2026 |
|
The Hidden Evolution: Implicit Reinforcement Learning and the Future of Iterative AI
|
Jan 05, 2026 |
|
The Math of Stability: DeepSeek-AI’s mHC and the Evolution of Macro-Architecture
|
Jan 01, 2026 |
|
MoE Giants: Decoding the 670 Billion Parameter Showdown Between DeepSeek V3 and Mistral Large
|
Dec 25, 2025 |
|
GLM-4.7 Deep Dive: 358B Parameters, Agentic Reasoning, and the Future of Open Weights
|
Dec 24, 2025 |
|
Beyond the Exam Room: Stress-Testing Clinical AI with Medmarks v0.1
|
Dec 23, 2025 |
|
ANDREJ KARPATHY 2025 LLM Review: RLVR, Jagged Intelligence, & The Vibe Coding Revolution
|
Dec 21, 2025 |
|
The Automated Karpathy Recipe: Master Neural Network Debugging with neural_net_checklist
|
Dec 18, 2025 |
|
Nemotron 3 Nano: The Hybrid Mamba-MoE Model Driving Efficient, 1M-Token Agentic AI
|
Dec 16, 2025 |
|
Olmo 3: Unpacking the Fully Open LLM Flow (Dolma 3, OlmoRL, & State-of-the-Art Reasoning)
|
Dec 14, 2025 |
|
The Code Red Gambit: GPT-5.2's Mega-Agent Architecture
|
Dec 13, 2025 |
|
Fara-7B: The 7B Agentic SLM Redefining On-Device CUA Performance
|
Dec 10, 2025 |
|
The AGI Frontier: DeepMind’s Decade of Breakthroughs-From DQN and AlphaZero to Solving Protein Folding.
|
Dec 07, 2025 |
|
INTELLECT-3: Scaling Agentic RL and MoE to SOTA Performance with prime-rl and 512 H200s
|
Dec 04, 2025 |
|
Kimi Founder Yang Zhilin on K2, Agentic LLMs, & AGI: The Beginning of Infinity | Scaling & Innovation Strategy
|
Nov 30, 2025 |
|
Ilya Sutskever on AI: Transitioning from Scaling to Research, Generalization, and the Future of Superintelligence
|
Nov 26, 2025 |
|
Neuromorphic Computing: Principles and Architecture
|
Nov 23, 2025 |
|
Gemini 3 Pro Release Review: Benchmarks, Generative UI, Deep Think Mode, and Google Antigravity
|
Nov 20, 2025 |
|
DeepSeek-OCR: Contexts Optical Compression
|
Nov 16, 2025 |
|
LLM Gambling Addiction: Behavioral and Neural Mechanisms
|
Nov 10, 2025 |
|
Glyph: Visual-Text Compression for Scaling Context Windows
|
Nov 02, 2025 |
|
Continual Learning via Sparse Memory Finetuning
|
Oct 26, 2025 |
|
Andrej Karpathy on AI, Intelligence, and Education
|
Oct 21, 2025 |
|
Untangling the xAI-OpenAI Legal War: Trade Secrets and Antitrust
|
Oct 04, 2025 |
|
IBM Granite 4.0: Hybrid Mamba/Transformer Breakthrough for Enterprise LLMs?
|
Oct 03, 2025 |
|
Anthropic's Claude Sonnet 4.5: The New Coding Standard?
|
Sep 30, 2025 |
|
GPT-5-Codex: Agentic Coding and OpenAI's Evolution
|
Sep 22, 2025 |
|
Grok 4 Fast: Speed, Efficiency, and Application Review
|
Sep 22, 2025 |
|
How to Read a Research Paper
|
Sep 14, 2025 |
|
The Science of Sampling
|
Sep 14, 2025 |
|
GPT-5 Revisited: Progress, Performance, and User Experience
|
Sep 12, 2025 |
|
Thyme Autonomous AI that Sees, Codes and Solves Problems
|
Sep 11, 2025 |
|
YaRN: Extending LLM Context Windows Efficiently
|
Sep 10, 2025 |
|
Ilya Sutskever's AI Vision: From Deep Learning Dogmas to Safe Superintelligence
|
Sep 09, 2025 |
|
Thyme: Think Beyond Images with Code-Executing MLLMs
|
Sep 07, 2025 |
|
What did Ilya see?
|
Sep 06, 2025 |
|
Meta's AI Ambitions: Turbulence in Superintelligence Labs
|
Sep 05, 2025 |
|
Hierarchical Reasoning: Bigger Isn't Always Better
|
Sep 04, 2025 |
|
Prime Collective Communications Library: A Technical Report
|
Sep 03, 2025 |
|
Prime Collective Communications Library: A Technical Report
|
Sep 03, 2025 |
|
MetaStone-S1: Reflective Generative AI for Test-Time Scaling
|
Sep 02, 2025 |
|
MetaStone-S1: Reflective Generative AI for Test-Time Scaling
|
Sep 02, 2025 |
|
ToonComposer: AI-Assisted Cartoon Production and Post-Keyframing
|
Sep 01, 2025 |
|
ToonComposer: AI-Assisted Cartoon Production and Post-Keyframing
|
Sep 01, 2025 |
|
Triton: Language, Compiler, and Optimization for AI Workloads
|
Aug 31, 2025 |
|
Triton: Language, Compiler, and Optimization for AI Workloads
|
Aug 30, 2025 |
|
Dynamic Fine-Tuning: Elevating LLM Generalization
|
Aug 29, 2025 |
|
Lessons from a Chimp: AI Scheming and Ape Language
|
Aug 28, 2025 |
|
Deciphering Reinforcement Learning for Language Models
|
Aug 28, 2025 |
|
STREAM3R: Scalable Streaming 3D Reconstruction with Causal Transformer
|
Aug 28, 2025 |
|
Yan: Interactive Video Generation Framework
|
Aug 27, 2025 |
|
Lessons from a Chimp: AI Scheming and Ape Language
|
Aug 26, 2025 |
|
NextStep-1: Unified Multi-modal Generation
|
Aug 26, 2025 |
|
STREAM3R: Scalable Streaming 3D Reconstruction with Causal Transformer
|
Aug 24, 2025 |
|
GLM-V: Advancing Multimodal Reasoning with RLCS
|
Aug 23, 2025 |
|
DINOv3: Self-Supervised Vision Foundation Models
|
Aug 22, 2025 |
|
GPT-5 and Grok 4: Altman vs Musk
|
Aug 21, 2025 |
|
Hugging Face Hub Storage: Xet vs. Git LFS
|
Aug 20, 2025 |
|
Channel-Wise MLPs Boost RCN Generalization
|
Aug 19, 2025 |
|
Fine-Tuning Custom Embedding Models for Enhanced Retrieval Performance
|
Aug 18, 2025 |
|
AdLlama: Boosting Ad CTR with Reinforcement Learning
|
Aug 17, 2025 |
|
Machine Learning: Models, Algorithms, and Reinforcement Learning
|
Aug 17, 2025 |
|
Mixture-of-Recursions: Adaptive Computation for Language Models
|
Aug 16, 2025 |
|
Operator-Based Machine Intelligence: A Hilbert Space Framework
|
Aug 15, 2025 |
|
Meta CLIP 2: A Worldwide Scaling Recipe
|
Aug 13, 2025 |
|
In-Context Learning: Implicit Weight Dynamics
|
Aug 12, 2025 |
|
GLM-4.5: Open Agentic, Reasoning, and Coding Foundation Models
|
Aug 11, 2025 |
|
RLVMR: Verifiable Meta-Reasoning for Long-Horizon Agents
|
Aug 10, 2025 |
|
CoT-Self-Instruct: High-Quality Synthetic Prompt Generation
|
Aug 09, 2025 |
|
GPT-5: Hype, Reality and the Future of AI
|
Aug 09, 2025 |
|
Seed-Prover: Advancing Automated Mathematical Reasoning with Formal Verification
|
Aug 08, 2025 |
|
Self-Evolving Agents: A Comprehensive Survey
|
Aug 07, 2025 |
|
High-Precision W and Z Boson Mass Measurement at CMS
|
Aug 07, 2025 |
|
Falcon-H1: Hybrid-Head LLMs for Efficiency and Performance
|
Aug 06, 2025 |
|
ASI-ARCH: AI-Driven Scientific Discovery for Neural Architecture
|
Aug 04, 2025 |
|
In-Context Learning: Implicit Weight Dynamics
|
Aug 03, 2025 |
|
Qwen3: Unifying Reasoning and Efficiency in LLMs
|
Aug 02, 2025 |
|
Group Sequence Policy Optimization for LLMs
|
Aug 01, 2025 |
|
Reinforcement Learning: Advancements, Applications, and Challenges
|
Jul 31, 2025 |
|
SPIRAL: Self-Play for Reasoning in Games
|
Jul 29, 2025 |
|
Qwen3-Coder: Agentic Coding and Model Capabilities
|
Jul 28, 2025 |
|
Hierarchical Reasoning Model: Brain-Inspired AI for Complex Tasks
|
Jul 27, 2025 |
|
Local LLM Solutions for Mac Silicon: Llama.cpp and LM Studio
|
Jul 26, 2025 |
|
Kimi K2: Open Agentic Intelligence and Applications
|
Jul 25, 2025 |
|
CARTRIDGES: Efficient Context for LLMs
|
Jul 24, 2025 |
|
Prompt Baking: Embedding LLM Behavior in Weights
|
Jul 23, 2025 |
|
Massistant: Chinese Mobile Forensic Tooling Revealed
|
Jul 22, 2025 |
|
Unexpected Military Roots of Digital Computing and Research
|
Jul 17, 2025 |
|
The 2025 AI Landscape: Progress and Outlook
|
Jul 16, 2025 |
|
The Dynamics of Neural Attention
|
Jul 15, 2025 |
|
Consciousness and Reality according to the CIA:Gateway
|
Jul 14, 2025 |
|
Military Roots of Digital Computing and Research
|
Jul 13, 2025 |
|
Accelerating Mobile AI with ExecuTorch and KleidiAI: Revisited
|
Jul 12, 2025 |
|
State-Adaptive Regularization for Offline Reinforcement Learning
|
Jul 11, 2025 |
|
Nash Learning from Human Feedback via Mirror Prox
|
Jul 10, 2025 |
|
MiniMax-M1: Scaling Test-Time Compute with Lightning Attention
|
Jul 09, 2025 |
|
Direct Reasoning Optimization for LLMs
|
Jul 08, 2025 |
|
AI's Impact on the US Workforce
|
Jul 07, 2025 |
|
LLaMA Factory: Easy LLM Fine-Tuning
|
Jul 06, 2025 |
|
Project Vend: Can Claude Run a Small Shop?
|
Jul 05, 2025 |
|
Self-Adapting Language Models (SEAL)
|
Jul 04, 2025 |
|
The Illusion of the Illusion of Thinking
|
Jul 03, 2025 |
|
The Illusion of Thinking in Reasoning Models
|
Jul 02, 2025 |
|
Meta-Reinforcement Learning with Minimum Attention
|
Jul 01, 2025 |
|
AI Persuasion Through Reinforcement Learning and Rhetoric
|
Jun 30, 2025 |
|
Reinforcement Learning for Assembly Code Optimization with LLMs
|
Jun 30, 2025 |
|
FileFix: Browser to PowerShell Social Engineering
|
Jun 29, 2025 |
|
Reinforcement Learning Under Unmeasured Confounding
|
Jun 28, 2025 |
|
Reinforcement Learning for Urban Air Quality Management
|
Jun 27, 2025 |
|
Reinforcement Learning in Non-Stationary Environments
|
Jun 26, 2025 |
|
Personalized Policy Learning from Heterogeneous Data
|
Jun 25, 2025 |
|
Boosting Reinforcement Learning with Human Feedback via SeRA
|
Jun 23, 2025 |
|
AXIOM: Active Inference Object-Centric World Models
|
Jun 22, 2025 |
|
Entropy and Reinforcement Learning for LLMs
|
Jun 21, 2025 |
|
FLEX Robot-Agnostic Force-Based Manipulation Learning
|
Jun 19, 2025 |
|
Agent RL Scaling for Mathematical Problem Solving
|
Jun 18, 2025 |
|
Beyond Reward: Limits of RL in LLM Reasoning
|
Jun 17, 2025 |
|
Reward Model Variance in RLHF
|
Jun 15, 2025 |
|
Power Grid Topological Control with Graph Reinforcement Learning
|
Jun 14, 2025 |
|
Decentralized RL for Multi-Resource Allocation via Dynamic Cluster Agreements
|
Jun 13, 2025 |
|
Reinforcement Learning for Humanoid Dexterous Manipulation
|
Jun 12, 2025 |
|
µCODE: Code Generation with Single-Step Rewards
|
Jun 11, 2025 |
|
Confidence-Reward Preference Optimization for Machine Translation
|
Jun 10, 2025 |
|
Personalized Preference Learning with MiCRo
|
Jun 09, 2025 |
|
ProRL Expands LLM Reasoning Boundaries
|
Jun 08, 2025 |
|
ProxyThinker: Guiding Large Models with Small Reasoners
|
Jun 07, 2025 |
|
Open CaptchaWorld: Benchmarking MLLM Agents
|
Jun 07, 2025 |
|
DexMachina: Functional Dexterous Bimanual Manipulation
|
Jun 06, 2025 |
|
3DMEM-BENCH: Long-Term Memory for Embodied AI
|
Jun 05, 2025 |
|
Fine-Tuning Large Language Models: A Comprehensive Guide
|
Jun 04, 2025 |
|
Maximizing Confidence Alone Improves Reasoning
|
Jun 02, 2025 |
|
Critical Points of Random Neural Networks
|
Jun 01, 2025 |
|
BAGEL: Vision-Language Model for Visual Generation
|
May 31, 2025 |
|
Incentivizing Knowledge Acquisition in LLMs via RL
|
May 31, 2025 |
|
RL for Image Generation: DPO vs GRPO
|
May 30, 2025 |
|
Let Androids Dream Framework
|
May 29, 2025 |
|
SmolVLM: Compact and Efficient Vision-Language Models
|
May 27, 2025 |
|
Federated Learning: Privacy-Preserving Collaborative Intelligence Survey
|
May 26, 2025 |
|
Compressed Federated Learning of Tiny Language Models
|
May 25, 2025 |
|
Mobile Intelligence Language Understanding Benchmark
|
May 24, 2025 |
|
AI-RAN: Converging Communications and Computing
|
May 23, 2025 |
|
Ollama LLM Fine-Tuning Methods
|
May 22, 2025 |
|
Customizing LLMs for High-Performance VHDL Design
|
May 21, 2025 |
|
Adaptively Weighted Nearest Neighbors for Matrix Completion
|
May 20, 2025 |
|
SAD Neural Networks, Divergent Gradient Flows, and Optimality
|
May 19, 2025 |
|
WavReward: Evaluating Spoken Dialogue Models
|
May 18, 2025 |
|
BLIP3-o Unified Multimodal Models
|
May 17, 2025 |
|
CodePDE: LLM-Driven PDE Solver Generation
|
May 16, 2025 |
|
Online Learning Neural Networks: Bounds and Characterization
|
May 15, 2025 |
|
UAV Visual Object Search in City Space
|
May 15, 2025 |
|
Benchmark for Auto-bidding Task
|
May 14, 2025 |
|
Reinforcement Learning with Human Feedback Improvements
|
May 06, 2025 |
|
T2I-R1: Reinforcing Image Generation with Bi-level CoT
|
May 05, 2025 |
|
Pretraining for Heterogeneous Treatment Effects
|
May 04, 2025 |
|
AI Jekyll-Hyde Tipping Point Formula
|
May 04, 2025 |
|
Personalizing Multimodal Models with Yo'Chameleon
|
May 03, 2025 |
|
Current Advances and Applications of AI, April 2025 Overview
|
May 01, 2025 |
|
Min-Form Credit Assignment for Process Reward Model Reasoning
|
May 01, 2025 |
|
Language Models for Automated Patient Record Linkage
|
Apr 30, 2025 |
|
Parameter-Efficient Continual Learning: A Survey
|
Apr 29, 2025 |
|
Building an Agent: LLM, Loop, and Tokens
|
Apr 28, 2025 |
|
Uncertainty-Guided Lung Tumor Segmentation via Coarse-to-Fine Refinement
|
Apr 27, 2025 |
|
Complex Instruction-Based Image Editing Benchmark
|
Apr 26, 2025 |
|
Sleep-Time Compute: Pre-computation for Efficient LLM Inference
|
Apr 25, 2025 |
|
Miras: A Framework for Designing Deep Learning Architectures
|
Apr 24, 2025 |
|
RUKA: A Compact and Affordable Humanoid Robotic Hand
|
Apr 23, 2025 |
|
GenEAva: Expressive Cartoon Avatar Generation via Diffusion
|
Apr 22, 2025 |
|
VCR-Bench: Video Chain-of-Thought Reasoning Evaluation
|
Apr 21, 2025 |
|
Automating LLM Hallucination Detection with Reasoning
|
Apr 20, 2025 |
|
Llama 4: Natively Multimodal AI Innovation
|
Apr 19, 2025 |
|
Self-Steering Language Models via Probabilistic Programs
|
Apr 18, 2025 |
|
Amazon Q Developer: AI for Data Science in SageMaker Canvas
|
Apr 17, 2025 |
|
Adaptive SVD for Continual Learning in Large Language Models
|
Apr 16, 2025 |
|
Llama 4: Natively Multimodal AI Innovation
|
Apr 15, 2025 |
|
UniOcc: Unified Occupancy Prediction and Forecasting Benchmark
|
Apr 13, 2025 |
|
Graph Counterfactual XAI via Latent Space Traversal
|
Apr 12, 2025 |
|
Continual Forgetting for Pre-trained Vision Models
|
Apr 11, 2025 |
|
Age of Updates for Adaptive OFDM in Autonomous Vehicles
|
Apr 10, 2025 |
|
Video Generation Improvement via Human Preference Alignment
|
Apr 09, 2025 |
|
AnimeGamer: Infinite Anime Life Simulation via MLLM
|
Apr 08, 2025 |
|
NoProp: Learning Neural Networks Without Backpropagation
|
Apr 07, 2025 |
|
ACPBench Hard: Generative Planning Reasoning Tasks
|
Apr 06, 2025 |
|
Efficient Training of Large Language Models
|
Apr 05, 2025 |
|
Uni4D Dynamic 4D Modeling from Casual Video
|
Apr 04, 2025 |
|
KDTalker: Audio-Driven Talking Portraits via Implicit Keypoint Diffusion
|
Apr 03, 2025 |
|
OLMo 2: Fully Open Language Model Advancements
|
Apr 02, 2025 |
|
Stable-SCore Stable 3D Shape Correspondence via Registration
|
Apr 01, 2025 |
|
ProjectEval: Benchmarking Project-Level Code Generation by LLM Agents
|
Mar 31, 2025 |
|
Embodied Agent Confidence Elicitation in Dynamic Multimodal Environments
|
Mar 30, 2025 |
|
VLMs Playing StarCraft II: A Multimodal Decision Benchmark
|
Mar 29, 2025 |
|
M-Attack: Simple Yet Effective Attacks Against Strong Vision-Language Models
|
Mar 28, 2025 |
|
Deep Learning for Inverse Design of Radio-Frequency Circuits
|
Mar 27, 2025 |
|
Coding with LLMs A Developer's Guide by Simon Willison
|
Mar 26, 2025 |
|
Vision-R1 Reasoning in Multimodal Large Language Models via RL
|
Mar 25, 2025 |
|
OWL: Optimized Multi-Agent Assistance for Task Automation
|
Mar 24, 2025 |
|
Generalized Kullback-Leibler Divergence Loss for Enhanced Learning
|
Mar 23, 2025 |
|
Unsloth: A Practical Guide to LLM Fine-Tuning
|
Mar 22, 2025 |
|
Introducing the New PyTorch Landscape
|
Mar 21, 2025 |
|
Deep Learning for Inverse Design of Radio-Frequency Circuits
|
Mar 20, 2025 |
|
Distill Any Depth: Monocular Depth Estimation via Distillation
|
Mar 18, 2025 |
|
Economical Inference: DeepSeek's Multi-Head Latent Attention in LLMs
|
Mar 16, 2025 |
|
SWE-RL: Reinforcement Learning for LLMs on Software Evolution
|
Mar 15, 2025 |
|
Optimizing Quantum Circuit Mapping with SAT Solving at Amazon
|
Mar 14, 2025 |
|
LM Studio SDK: Python and TypeScript APIs for Local AI
|
Mar 13, 2025 |
|
GameFi AI Agents, DeFi, and Decentralized Virtual Ecosystems
|
Mar 12, 2025 |
|
LLMS Play Among Us
|
Mar 11, 2025 |
|
AN/UYK-1: Stored Logic Multiple-Purpose Digital Computer
|
Mar 10, 2025 |
|
Training Code Generation Models for Self-Debugging
|
Mar 09, 2025 |
|
LLMs in The Chameleon Game: Strategic Information Dynamics
|
Mar 09, 2025 |
|
GameFi: AI Agents, DeFi, and Decentralized Virtual Ecosystems
|
Mar 08, 2025 |
|
Training Code Generation Models for Self-Debugging
|
Mar 06, 2025 |
|
Accelerating Generative AI with PyTorch: Fast Inference with SAM2
|
Mar 04, 2025 |
|
V-HOP Visuo-Haptic 6D Object Pose Tracking
|
Mar 03, 2025 |
|
FACTR Force-Attending Curriculum Training for Contact-Rich Policy Learning
|
Mar 02, 2025 |
|
Language Model Training for Social Deduction in Among Us
|
Mar 01, 2025 |
|
Depth Pro Sharp Monocular Metric Depth Estimation
|
Feb 28, 2025 |
|
MME-CoT Benchmarking Chain-of-Thought in Large Multimodal Models
|
Feb 27, 2025 |
|
Unsloth Efficient GRPO for Long-Context Reasoning Models
|
Feb 26, 2025 |
|
CoT-Valve Tunable Length Control for Chain-of-Thought Reasoning
|
Feb 25, 2025 |
|
Implementing Transformers from Scratch
|
Feb 25, 2025 |
|
Reflection and Refraction
|
Feb 24, 2025 |
|
MixGCN Scalable Graph Convolutional Network Training
|
Feb 23, 2025 |
|
Open-Source AI The Imperative for Transparency
|
Feb 22, 2025 |
|
Forge Reasoning API and Nous Chat Advancing LLM Inference
|
Feb 21, 2025 |
|
Gradient Equilibrium in Online Learning
|
Feb 20, 2025 |
|
Encoder-Free 3D Large Multimodal Models An Investigation
|
Feb 19, 2025 |
|
Intel and PyTorch Empowering Generative AI
|
Feb 19, 2025 |
|
Iterative Prompting and LLM Code Optimization
|
Feb 18, 2025 |
|
Everything You Always Wanted To Know About Mathematics
|
Feb 17, 2025 |
|
The Instruct Monomyth_ Why Base Models Matter
|
Feb 16, 2025 |
|
DSJJJJ Desideratic AI and Mischievous Instability
|
Feb 15, 2025 |
|
Simplified PyTorch MLOps Workflow with Arm and GitHub
|
Feb 14, 2025 |
|
UMed-LVLM_ Unveiling Medical Abnormalities in Vision-Language Models
|
Feb 13, 2025 |
|
Ploppie_ A LiteLLM Abstraction Layer
|
Feb 12, 2025 |
|
Heat's Demise of Quantum Entanglement
|
Feb 11, 2025 |
|
Provably Autonomous AI Agents on Twitter
|
Feb 10, 2025 |
|
Confidence-Reward Driven Preference Optimization for Machine Translation
|
Feb 09, 2025 |
|
Exotic Smooth Four-Manifolds
|
Feb 08, 2025 |
|
Neuro-Symbolic AI A 2024 Systematic Review
|
Feb 07, 2025 |
|
YuLan-Mini A Data-Efficient Language Model
|
Feb 06, 2025 |
|
Jasper and Stella: Distilling State-of-the-Art Embedding Models
|
Feb 05, 2025 |
|
Creating a unique agent with ElizaOS
|
Feb 04, 2025 |
|
DeepSeek-V3 A 671B Parameter Mixture-of-Experts Language Model
|
Feb 03, 2025 |
|
Alice's Adventures in Differentiable Wonderland
|
Feb 02, 2025 |
|
Cline Development Assistant
|
Feb 01, 2025 |
|
Hyperbolic Time Chambers and Brain Emulation
|
Jan 31, 2025 |
|
Genesis A Universal Physics Engine for Robotics
|
Jan 30, 2025 |
|
Evolutionary & Market-Based Optimization
|
Jan 29, 2025 |
|
Benchmarking LLM Creativity and Diversity
|
Jan 28, 2025 |
|
Distilling GPT-4 for Wine Grape Variety Classification
|
Jan 27, 2025 |
|
Efficient Attention Mechanisms in Transformers
|
Jan 26, 2025 |
|
Byte Latent Transformer and Other AI Research at Meta
|
Jan 25, 2025 |
|
AI Agent Workflow and Deployment
|
Jan 24, 2025 |
|
Absolute Unit Neural Networks
|
Jan 23, 2025 |
|
LLMs and the Brain_ A Converging Architecture
|
Jan 22, 2025 |
|
Neuroevolution A Review
|
Jan 21, 2025 |
|
Building a High-Frequency Trading Exchange
|
Jan 20, 2025 |
|
The Unreasonable Effectiveness of Data and Scaling in AI
|
Jan 19, 2025 |
|
Patents and Interview: Inertial Mass Reduction in Craft
|
Jan 18, 2025 |
|
ChatGPT-4o in Financial Data Analysis
|
Jan 17, 2025 |
|
Exotic Smooth Four-Manifolds
|
Jan 16, 2025 |
|
Monolith_ A Real-Time Recommendation System
|
Jan 15, 2025 |
|
Automating Artificial Life Discovery with Foundation Models
|
Jan 14, 2025 |
|
Building Effective Agents with LLMs
|
Jan 13, 2025 |
|
Latent Reasoning in Large Language Models
|
Jan 12, 2025 |
|
LLM Multi-Step Reasoning_ Think-to-Talk or Talk-to-Think_
|
Jan 11, 2025 |
|
Neural Observation Field Guided Hybrid Camera Placement Optimization
|
Jan 10, 2025 |
|
Phi-4_ A 14B Parameter Language Model
|
Jan 10, 2025 |
|
Post-Hoc MOTS_ Time-Symmetric Multi-Object Tracking
|
Jan 09, 2025 |
|
Thompson Sampling Regret Bounds for Logistic Bandits
|
Jan 08, 2025 |
|
Bi-Level Optimization for Redundant Manipulator Trajectory Optimization
|
Jan 07, 2025 |
|
An end-to-end attention-based approach for learning on graphs
|
Jan 06, 2025 |
|
DMRA_ Diffusion Model with Representation Alignment for Protein Inverse Folding
|
Jan 05, 2025 |
|
Training Jacobians of Neural Networks
|
Jan 04, 2025 |
|
xAI's Colossus_ A Million-GPU Supercomputer
|
Jan 03, 2025 |
|
Situational Awareness_ The Coming Age of Superintelligence
|
Jan 02, 2025 |
|
The Return of Pseudoscience in AI
|
Jan 02, 2025 |
|
Surpassing OpenAI's O1_ Distillation and the Bitter Lesson
|
Jan 01, 2025 |
|
Rebooting the Arsenal of Democracy
|
Jan 01, 2025 |
|
QwQ_ Exploring AI Reasoning Capabilities
|
Dec 31, 2024 |
|
Parametric PerceptNet for Image Quality Assessment
|
Dec 30, 2024 |
|
Optimizing Mixed-Input Matrix Multiplication on NVIDIA Ampere
|
Dec 29, 2024 |
|
OpenAI's o1_ Reasoning with LLMs
|
Dec 28, 2024 |
|
O1 Replication_ Distillation, Progress, and Lessons
|
Dec 27, 2024 |
|
Moto_ A Latent Motion Token Language Model for Robot Manipulation
|
Dec 26, 2024 |
|
Nonlinear Unitary Photonic Circuits for Deep Learning
|
Dec 26, 2024 |
|
MAG-V_ A Multi-Agent Framework for Synthetic Data Generation and Verification
|
Dec 26, 2024 |
|
Machines of Loving Grace_ AI's Transformative Potential
|
Dec 25, 2024 |
|
Hybrid-SQuAD_ A Scholarly Question Answering Dataset
|
Dec 24, 2024 |
|
LearnLM_ A Google AI for Education
|
Dec 24, 2024 |
|
HunyuanVideo_ A Large Open-Source Video Generation Model
|
Dec 23, 2024 |
|
Fine-Tuning Mosquito Larvae Locomotion via Reinforcement Learning
|
Dec 22, 2024 |
|
Fine-Tuning LLMs with Ollama
|
Dec 21, 2024 |
|
FedDW_ Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning
|
Dec 20, 2024 |
|
Exphormer_ Scaling Transformers for Graph-Structured Data
|
Dec 20, 2024 |
|
DHCP_ Detecting Hallucinations in Large Vision-Language Models
|
Dec 19, 2024 |
|
Benchmarking 25 State-of-the-Art LLMs
|
Dec 18, 2024 |
|
Detecting AI-Generated Responses in Multiple-Choice Assessments
|
Dec 17, 2024 |
|
Avoiding Rookie Mistakes in Machine Learning
|
Dec 16, 2024 |
|
AI-Powered Ultrasound for Global Maternal Healthcare
|
Dec 16, 2024 |
|
DeMo_ Decoupled Momentum Optimization for Large Neural Networks
|
Dec 15, 2024 |
|
CS Freshmen and ChatGPT_ A Log Analysis
|
Dec 15, 2024 |
|
AI Compiler for Autonomous Vehicles
|
Dec 14, 2024 |
|
Competitive Programmer's Handbook
|
Dec 13, 2024 |
|
AI Coding Tool Showdown_ Cursor, Bolt, Replit, and V0 Compared
|
Dec 12, 2024 |
|
Challenges in Human-Agent Communication
|
Dec 11, 2024 |
|
ASL Fingerspelling Recognition Competition
|
Dec 10, 2024 |
|
Accelerating Mobile AI with ExecuTorch and KleidiAI
|
Dec 10, 2024 |