Ollama 0.15 adds launch for 4 coding CLIs – 64K+ context tuning

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

Ollama shipped v0.15 with ollama launch, a one-command wrapper to run Claude Code, Codex, Droid, and OpenCode against a selected backend; the pitch is reducing per-tool env-var/config glue while making “swap local vs cloud model” a single knob. The same drop spotlights GLM‑4.7 Flash memory optimization for 64k+ context runs and positions Ollama Cloud as the fallback for full precision/long context; early papercuts exist—one report shows ollama launch claude failing with “claude is not installed” despite the CLI working, suggesting PATH/detection brittleness.

• Codex subagents: /experimental enables orchestrator/worker spawning; workers appear pinned to gpt‑5.2‑codex while orchestrators inherit parent model choice—routing control may be limited; OpenAI says a high Responses API error rate was fixed, but no RCA.
• Cursor: subagents can pin or inherit models via .cursor/agents/<agent>.md; an unverified field claim says Salesforce has 20,000+ active users, ~90% adoption, and >30% PR velocity.
• Crawler tax: codebase.md reports ~771.5k crawler hits vs ~9.7k humans in 30 days; Amazonbot alone at ~408k/month even with Cloudflare blocks.
• MCP plumbing: Google Cloud proposes gRPC as a native MCP transport with bidirectional streaming, replacing JSON-RPC gateways—cleaner fit for enterprise meshes, but still a proposal.

Clawdbot ops wave: always-on local agents, deployment patterns, and brittle sessions

Clawdbot is going viral as an always-on, local-first “AI employee.” Engineers are rapidly deploying it (Mac mini/VPS), then immediately hitting real ops problems: session corruption, prompt-injection risk, and isolation/sandboxing decisions.

Today’s dominant builder story is the Clawdbot adoption spike: people wiring it into Slack/Gmail/Asana/Telegram, debating dedicated hardware vs VPS, and learning failure modes (session corruption, tool-call brittleness). Excludes specific Clawdbot skills/plugins (covered under Plugins/Skills).

Jump to Clawdbot ops wave: always-on local agents, deployment patterns, and brittle sessions topics

🦞 Clawdbot ops wave: always-on local agents, deployment patterns, and brittle sessions

Clawdbot brittleness: corrupted tool calls can break sessions, and /new is the recovery ritual

Clawdbot (clawdbot): Multiple users hit the same failure mode: malformed/corrupted tool calls “poison” a running session, sometimes forcing deletion of session history; the common workaround is starting a fresh thread with /new, as documented in Session broke out of house and the tips thread in Tips and tricks thread. This matters because it’s not a model-quality issue—it’s an orchestration/serialization fragility that shows up only once you run long-lived agents.

• Observed symptoms: Reports range from “can’t talk to it while away” in Session broke out of house to repeated “corrupt tool calls” complaints in Corrupted tool calls.
• Operational workaround: The /new guidance is repeated explicitly in Tips and tricks thread and again as a concrete fix in Start a new session.

Ollama 0.15 adds launch for 4 coding CLIs – 64K+ context tuning

Executive Summary

Top links today

Clawdbot ops wave: always-on local agents, deployment patterns, and brittle sessions

Table of Contents

🦞 Clawdbot ops wave: always-on local agents, deployment patterns, and brittle sessions

Clawdbot brittleness: corrupted tool calls can break sessions, and /new is the recovery ritual

Clawdbot setup wave: wiring Slack/Telegram/Gmail/Cal/Asana/HubSpot/Obsidian into one agent

Isolation debate: running Clawdbot on your main machine vs dedicated box/VPS to limit blast radius

Clawdbot hybrid inference: controlling LM Studio remotely and swapping in local models for easy tasks

Clawdbot Node deployment: VPS-first setup as the alternative to extra machines

Mac mini buying wave for Clawdbot meets pushback: old hardware or AWS Free Tier works

AI crawler ops pain: codebase.md reports 771k crawler hits vs 9.7k humans in 30 days

Clawdbot privacy reality check: state is local, but providers and chat platforms still see content

Trust calibration warning: new users are over-believing Clawdbot outputs

Files-over-apps for agent runners: reducing brittleness by keeping workflows as files

🧠 Codex agent updates: subagents, reliability fixes, and model-routing quirks

Codex adds subagents behind /experimental, with worker vs orchestrator roles

Codex subagent workers appear hardcoded to gpt-5.2-codex

OpenAI says Responses API error rate is fixed (Codex should be stable)

Builders report a bad day for GPT-5.2 Codex on real coding tasks

Codex review mode will fetch upstream docs by itself, but GitHub auth can block it

Codex users are explicitly asking for Cowork-style UX and team workflows

Demand grows for a web-resumable, agents-first remote dev environment

🧩 Cursor subagents: model selection, defaults, and task-specific routing

Cursor subagents can pin or inherit models via .cursor/agents/<agent>.md

Cursor reportedly hits ~90% adoption inside Salesforce with 20k+ active users

Cursor ships built-in Explore and General-purpose subagents with name invocation

Cursor subagents: task-specific model routing across providers

🖥️ Local inference & self-hosting: Ollama ‘launch’, memory tuning, and local toolchains

Ollama 0.15 adds `ollama launch` to run Claude Code/Codex/Droid/OpenCode on Ollama models

Ollama tunes GLM-4.7 Flash for lower-memory 64k+ context sessions

Qwen3-TTS roadmap: vLLM streaming inference and an upcoming 25Hz control release

Ollama Cloud positions GLM-4.7 as a fallback for full-precision long context

Bug report: `ollama launch claude` mis-detects Claude Code installation

📊 Claude in Excel: spreadsheet agents become “real work”

Finance and corporate teams report big productivity jumps from Claude in Excel

Claude in Excel shows one-shot messy-sheet table segmentation via an agent harness

Spreadsheet copilot gap: Claude-in-Excel/Sheets vs Gemini-in-Sheets becomes a loud complaint

🧰 Workflow patterns: context hygiene, verification loops, and “files over apps”

“Files over apps” becomes a durability tactic for long agent sessions

A two-model workflow: creative driver plus correctness verifier

Overtrust in agent outputs is emerging as an operational failure mode

Repeatable “fresh eyes” self-review loop for agent-written code

“Vibe coding” gets a sharper definition (and a boundary)

Agent-first engineering: lower barrier, higher throughput ceiling

Why do all assistants sound the same? Personality convergence debate returns

🧱 Plugins/Skills: orchestrators, skill marketplaces, and hardening packages

ACIP ships a Clawdbot integration for prompt-injection hardening

Kilo for Slack: bug report thread to PR without leaving Slack

oh-my-opencode v3.0.0: production-ready orchestrator with dynamic agents and plan-to-code compilation

“Better call codex” skill adds cross-model Codex subagent spawning

Context7 launches a Skill Marketplace for agent extensions

Open-source presentation agent turns context files + a template into PPT/PDF

🏗️ Agent frameworks & memory layers: from tracing to “memory OS”

MemOS pushes agent memory toward an inspectable “memory OS” layer

LangSmith case study: making an AI SDR chatbot shippable with traces and evals

🔌 Orchestration & MCP: transport upgrades and interactive agent UIs

Google Cloud proposes gRPC as a native transport for Model Context Protocol

CopilotKit passes 28,000 GitHub stars as it pushes MCP Apps and AG-UI

🏭 Infra signals: GPU demand shocks, packaging capacity, and AI power politics

H100 rental prices jump, attributed to Claude Opus 4.5 demand shock

Goldman projects TSMC CoWoS capacity to reach ~2,310k wafers by 2027

David Sacks relays Trump: AI companies should become “power companies”

FT: US plans $1.6B investment in USA Rare Earth to secure AI-chip magnets

codebase.md sees 771k crawler hits in 30 days, swamping human traffic

📦 Model releases & upgrades: roleplay LMs, TTS roadmap, and China frontier churn

Qwen3-TTS updates: vLLM streaming in progress and a 25Hz control model teased

MiniMax releases M2-her for roleplay, with new message roles and 32k context

ERNIE 5.0 is now described as officially live, but with few concrete specs

🧪 Reasoning & training ideas: ‘societies of thought’, scaling-law search, and energy-based models

Reasoning traces look like internal multi-agent debate, not monologue

Evolutionary agent searches scaling laws that extrapolate better

Energy-based “Kona” pitches constraint-scored reasoning over next-token sampling

Non-English internal reasoning increases diversity in English answers

Demis Hassabis frames AGI as needing continual learning, memory, and planning breakthroughs

RL training stack pattern: decouple trainer, inference engine, and environment servers

📄 Research papers: evals, security, and benchmark realism checks

CaMeLs proposes system-level prompt-injection defenses for computer-use agents

Physics-IQ finds today’s video generators still fail basic physics checks