Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

OpenJarvis: Local-first AI agents that run entirely on-device

The framework addresses concerns over data privacy, latency, and subscription costs by making local inference the default and cloud APIs optional.

byAytun Çelebi
March 16, 2026
in Research
Home Research
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

Stanford University researchers have released OpenJarvis, an open-source framework designed for building personal artificial intelligence agents that operate entirely on-device.

The framework aims to reduce latency, recurring costs, and data exposure concerns associated with cloud-based AI solutions by prioritizing local execution. This approach positions local AI as a default, with cloud usage becoming an optional component.

OpenJarvis originates from Stanford’s Scaling Intelligence Lab. It functions as both a research platform and a deployment infrastructure for local-first AI systems.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

The project emphasizes the complete software stack needed for on-device agents, including usability, measurement, and long-term adaptability. The research cites prior work, “Intelligence Per Watt,” which found local language models could handle 88.7% of chat and reasoning queries at interactive latencies. Efficiency improved 5.3x between 2023 and 2025 according to the team.

OpenJarvis employs a “Five-Primitives” architecture: Intelligence, Engine, Agents, Tools & Memory, and Learning. These primitives function as composable abstractions for independent benchmarking and optimization.

The Intelligence primitive acts as the model layer, providing a unified catalog for various local model families. This abstraction allows developers to select models without manually tracking parameter counts or hardware fit.

The Engine primitive serves as the inference runtime, offering a common interface for backends such as Ollama, vLLM, SGLang, llama.cpp, and cloud APIs. It includes commands like “jarvis init” to detect hardware and recommend configurations, and “jarvis doctor” for maintenance.

The Agents primitive forms the behavior layer, translating model capabilities into structured actions under device constraints. It supports composable roles, including an Orchestrator for task breakdown and an Operative for personal workflows.

The Tools & Memory primitive constitutes the grounding layer. This includes support for MCP (Model Context Protocol) for tool use, Google A2A for agent-to-agent communication, and semantic indexing for local retrieval. It also connects local models to tools and persistent personal context.

The Learning primitive provides a closed-loop improvement mechanism. It uses local interaction traces to generate training data, refine agent behavior, and enhance model selection. Optimization occurs across model weights, LM prompts, agentic logic, and the inference engine.

OpenJarvis prioritizes efficiency, treating energy, FLOPs, latency, and cost as key constraints alongside task quality. It incorporates a hardware-agnostic telemetry system for profiling energy on NVIDIA GPUs, AMD GPUs, and Apple Silicon, with 50 ms sampling intervals. The “jarvis bench” command standardizes benchmarking for latency, throughput, and energy per query.

Developer interfaces for OpenJarvis include a browser application, a desktop application for macOS, Windows, and Linux, a Python SDK, and a command-line interface (CLI). All core functionality operates without a network connection.

The “jarvis serve” command starts a FastAPI server with SSE streaming, which the developers state can serve as a drop-in replacement for OpenAI clients. This feature is intended to lower the migration cost for developers prototyping with an API-shaped interface while maintaining local inference.


Featured image credit

Tags: openjarvis

Related Posts

Study links AI-assisted homework to lower exam scores

Study links AI-assisted homework to lower exam scores

June 22, 2026
Harvard and Boston Children’s use AI to revisit unsolved genetic cases

Harvard and Boston Children’s use AI to revisit unsolved genetic cases

June 19, 2026
Adobe report finds 86% of creators now use generative AI in workflows

Adobe report finds 86% of creators now use generative AI in workflows

June 17, 2026
AI transfer learning speeds cosmology research but has hidden risks

AI transfer learning speeds cosmology research but has hidden risks

June 15, 2026
Phishing scams targeting travelers hit record levels in 2026

Phishing scams targeting travelers hit record levels in 2026

June 15, 2026
Most UK SMEs now consult AI before their accountants

Most UK SMEs now consult AI before their accountants

June 12, 2026

LATEST NEWS

Samsung adopts ChatGPT Enterprise and Codex across global workforce

Samsung Galaxy S27 Pro leak points to built-in Privacy Display

Perseverance rover completes a marathon on Mars

Polymarket accused of paying creators to post misleading TikTok bet videos

OpenAI improves health responses for free ChatGPT users

Adobe expands Firefly AI across Premiere, Illustrator, InDesign and Frame.io

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Novoresume

PolyAI

SeaArt

H2O.ai

Techpresso

Namecheap Free Logo Maker

Binaural Beats Factory

Lyricallabs

Jobscan

Vsub

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.