Docs / Releases

June 2026

Get started
Looking for what's coming next?See the roadmap. Shipping soon, up next, and what we're exploring.

v0.8.12

Overall polish everyday feel of the assistant: in-progress steps inside chats show a live timer and the assistant's thinking hovers to reveal how long it reasoned; you can search inside a long chat to jump to a specific word or moment; messages stream more smoothly and stay ordered even when the connection stutters; and a new Balanced economy profile gives you a middle-ground option that keeps costs down without giving up much quality.

  • Sign in through your browser. Logging in now hands you off to your normal browser (Safari, Chrome, whatever you use) and brings you back. Small change, but your password manager, saved logins, and two-factor codes all work the way they normally do.
  • See how long things take. In-progress steps show a live timer so you know it's working. Hover over its "thinking" to see how long it spent reasoning. Less guessing about whether it's stuck.
  • Find within a conversation. Search inside a long chat to jump to a specific word or moment.
  • Smoother streaming. Messages hold together more reliably even if your connection stutters. Fewer replies arriving out of order, duplicating, or vanishing.
  • A new budget-friendly profile for keeping costs down without giving up much quality. A middle ground between cheapest and best. We call it Balanced economy.

v0.8.11

Web searches inside chats show as clean, clickable cards you can verify, adds MiniMax M3 as a new model option, and refreshes the skill lineup with a new geo-audit skill, a rebuilt app builder, and document writing that consistently uses the proper editor. Plus, opening the Mac desktop app now installs the Vellum CLI into your shell PATH automatically.

  • MiniMax M3 added as an option.
  • Web searches are easier to follow. When the assistant looks something up, results appear as clean, clickable cards in the conversation. See what it searched and click straight to the sources.
  • New geo-audit skill for location and SEO analysis, now in the catalog.
  • App builder rewrite. The skill that builds dashboards, simple tools, and pages was rebuilt to be more reliable, so what you ask for comes out working more consistently.
  • More reliable document writing. Ask for something substantial and it now reliably uses the proper editor, giving cleaner, better-formatted results.
  • The Mac desktop app now installs the Vellum CLI into your shell PATH automatically, so running vellum from any Terminal window works without a separate install step. The wrapper points at a locator file that the app refreshes on every launch, keeping vellum messagevellum upgrade, and shell completions available and pointing at the live CLI binary.

v0.8.9

Pro subscribers bundle credits directly into their plan, makes the daily heartbeat calmer by default, gives scheduling a friendlier builder with plain-language descriptions and a tidier list that hides past one-time tasks and system jobs, and lets you download all your invoices, including Pro and ad-hoc credit purchases, as a single zip.

  • Pop conversations into their own windows. Open any conversation in its own window so you can keep one chat visible while working in another. There's also a new "Quick Input" window: a small box to dash off a message without opening the full app or losing your place.
  • Keyboard shortcuts. Set your own system-wide hotkeys that work no matter what app you're in. Press it and the assistant pops up, or dictation starts, instantly.
  • Native notifications. Alerts now arrive as proper macOS notifications with buttons you can click to act, without opening the app first.
  • A cleaner icon and lots of polish. A sharper app icon plus many small visual fixes so it looks at home on macOS.
  • Easier scheduling. A friendlier builder, with a plain-language description of when it'll actually run ("every weekday at 9am") so you're not second-guessing the timing.
    • Tidier schedules page. Old one-time tasks are collapsed and system jobs are tucked away, so you see the schedules you care about.
  • Calmer by default. The heartbeat now runs at most twice a day out of the box, and you can pause or turn it off whenever. Less noise, more control.
  • You can now bundle credits as part of your pro plan: Clearer credit purchasing. The Pro credit screen is cleaner and shows your base monthly fee up front. No surprises.
  • Download all invoices at once in a single zip instead of clicking through one at a time. Includes invoices for pro plan and ad hoc credit purchases.

v0.8.8

  • Cross-machine device pairing: vellum pair mints a device-scoped, revocable token on the assistant host and prints a pairing bundle, vellum connect import sets up a persistent paired assistant on another machine, vellum client --token opens an ephemeral session, and vellum devices and vellum unpair list, revoke, and forget pairings.
  • Richer chat surfaces: new work-result, choice, and copy surfaces, plus managed-OAuth connect and approval surfaces so the assistant can hand you structured results and connection prompts inline.
  • Pro plan credit bundles: pick a credit bundle during upgrade and change flows, with the current bundle shown on the billing plan card.
  • In-chat find: Edit menu now has Find (Cmd+F) wired to in-chat search on both macOS and web.
  • Streaming made visible: thinking blocks render live in the chat transcript, a spinning ring wraps your custom avatar while the assistant is working, and the Dock and menu-bar icons reflect the assistant avatar with a live status indicator.
  • Skill browsing: skills carry category metadata and SVG icons in their SKILL.md frontmatter, rendered in the web app with dynamic categories.
  • Library app polish: apps open fullscreen on web and mobile, with an Edit button in the Library app viewer.
  • Cloudflare tunnel provider: new tunnel option for exposing a local assistant.
  • Model profile polish: advanced params shown only after a model is chosen, the un-pinned params now read "Default" instead of "Inherit," and advanced-param spacing tightened up.
  • Web search sources: clamped sources expand behind a "+N more" pill.
  • Connection reliability: SSE reconnects are self-diagnosing with single-cursor resumable replay, and truncated thinking blocks heal on reconnect.

v0.8.7

Highlights

  • Improved reliability of web release builds, ensuring shared packages are correctly installed before the build process runs
  • Fixed a configuration issue that caused web release builds to not properly enable platform mode, resolving potential runtime errors in the web application

Build: 0.8.7 Commit: cfe9507d34 Built at: 2026-06-03 01:43:06 UTC

v0.8.6

A model and reliability update: Claude Opus 4.8 joins the hosted catalog, Gemini 3.5 Flash is now available to cloud users after a missing platform rate card was blocking requests, the sleep policy gains 1-hour and 3-hour idle presets, per-assistant sleep thresholds now survive wake cycles, inbound email is fixed on Resend-registered domains, disk-usage alerts fire an owner email, and API key rotation gets a propagation grace period.

Highlights

  • Claude Opus 4.8 now available: Anthropic's latest Opus model is live in the hosted catalog and selectable across all surfaces
  • Gemini 3.5 Flash on the hosted service: the model was already selectable in the local UI but a missing platform-side rate card returned a 400 for every cloud request; the rate card and billing entry are now wired in so cloud users can select it without hitting a rejection
  • Sleep policy 1-hour and 3-hour presets: two shorter idle-sleep durations join the sleep policy control in Settings alongside the existing 1-day, 3-day, and 7-day options so assistants can be tuned to sleep faster without using the full daily minimum
  • Idle-sleep thresholds persist across wake cycles: per-assistant sleep cutoff settings were being reset to the system default after each wake event; the configured threshold now survives correctly
  • Disk-usage email notifications: when workspace storage crosses the alert threshold the assistant owner now receives an email with remediation links to upgrade storage or ask the assistant to clean up, alongside the existing in-app error notification
  • API key rotation grace period: a short overlap window during key rotation keeps the old key valid while the new key propagates, eliminating the authentication gap that could cause brief request failures on self-hosted instances
  • Email inbound receiving fixed on Resend domains: newly registered Resend domains had receiving silently disabled because the create-domain API call was missing the nested capabilities object; inbound replies were being dropped with no error surfaced

v0.8.5

A billing and reliability release: MiniMax is now an OpenAI-compatible provider, self-hosted assistants route runtime calls directly to their own ingress, billing and resize flows get confirmation steps and clearer copy throughout, email and domain settings gain a live verification chip and confirmation modals, the CLI gains tab management and roadmap CRUD, and the avatar editor learns to generate from a prompt or reset to default.

Highlights

  • MiniMax is now available as an OpenAI-compatible provider: configure MiniMax models through the OpenAI-compatible provider flow with primary and fallback endpoint validation, with macOS thinking UI and conversation title generation updated to recognize MiniMax's <think> reasoning tags so they render in the collapsible thinking surface and never bleed into auto-generated titles
  • Self-hosted assistants route runtime calls directly to their own ingress: when you connect a self-hosted assistant from the web app, message and event calls go straight to your assistant with the platform actor token attached, instead of being proxied through the cloud platform
  • Billing and resize flows are clearer and safer: tier upgrades go through a read-only confirmation step before submitting to Stripe, plan resize triggers a confirmation modal after the Stripe portal step, adjust-plan failures surface inline instead of failing silently, downgrades show resource changes in read-only resource cards with an inline downgrade path, and the Compute & Resources card uses a unified tile layout with contextual resize CTAs
  • Email and domain settings are easier to work with: domain settings show a live verification status chip, domain registration and deletion go through confirmation modals, the managed email upsell auto-opens the upgrade modal, the domain status CLI is wired to the live verification endpoint, and the assistant handle auto-locks when it matches the subdomain
  • Disk pressure and credits banners surface action: disk pressure banners are redesigned with clearer state, and the credits exhausted banner now links directly to auto top-ups
  • Multi-tab management from the CLI: list, open, switch, and close conversation tabs with new assistant tabs subcommands, with each connected client tracking its own tab scope independently
  • Roadmap from the CLI: the assistant roadmap command now supports create, search, and upvote alongside listing, so you can manage roadmap items end-to-end without leaving the terminal
  • Avatar editor gains Generate with AI and Reset to default options, so you can spin up an avatar from a prompt or revert to the original without leaving the editor
  • Composer emoji aliases: :huff and :steam now resolve to 😤 alongside the existing alias set
  • Chat reliability fixes: the thinking indicator no longer survives stale terminal events because send state is now scoped to the active turn, and macOS conversation list pagination keeps the scheduled section from being truncated on large histories
  • Share Feedback dialog polish: cleaner layout, design library components throughout, and contextual tips, so reporting an issue feels less like filling out a form
  • LaunchDarkly references removed from the repo, with the assistant fully moved to push-driven feature flag invalidation via SSE so flag changes take effect immediately instead of waiting on a polling loop

v0.8.4

An identity and model-catalog release: every assistant gets a unique handle that doubles as its subdomain, the document editor gains agent-driven comments and find-and-replace, the model picker adds Gemini 3.5 Flash, Gemini 3.1 Flash-Lite GA, Grok 4.3, and Owl Alpha, OpenAI-compatible providers can finally point at any endpoint on macOS, and a new /clean slash command and a dock-icon fix tighten the day-to-day feel.

Highlights

  • Unique assistant handles and matching subdomains: every assistant now has a unique handle that doubles as its subdomain by default, the handle is editable inline from About Assistant, the subdomain locks to match it, and existing assistants were backfilled so handles are non-nullable across the system
  • Document editor agent tools: the assistant can now leave inline comments on documents and use grep and sed-style find-and-replace tools, so iterative edits run as tool calls instead of full rewrites
  • Custom email address builder in Pro onboarding: the Pro onboarding domain step now includes an inline email-address builder so you can configure a custom email domain end to end during signup, and the new assistant domain register --email-username flag does the same from the CLI
  • Expanded model catalog: Gemini 3.5 Flash and Gemini 3.1 Flash-Lite GA via Google, Grok 4.3 and Owl Alpha via OpenRouter, and Fireworks now respects per-model reasoning_effort ceilings so deep-thinking models stop getting clipped by a global cap
  • OpenAI-compatible providers on macOS: the provider connection editor now exposes Base URL and Models fields so you can point macOS at any OpenAI-compatible endpoint (vLLM, Ollama, LM Studio, llama.cpp) without leaving Settings
  • New /clean slash command: strips runtime injections from the conversation without summarizing the rest of the turn, available in the web chat composer slash picker and on macOS
  • Inline file attachments in assistant message bubbles on macOS: file chips now render between paragraphs in assistant replies instead of all batching at the bottom of the message
  • macOS performance and reliability: dock icon image processing moves off the main thread to eliminate 2s+ AppHang events, self-signed certificate creation no longer duplicates on every build, and the home feed preserves read-row text contrast on dark backgrounds

v0.8.3

A home feed and notifications intelligence release: the macOS home page now separates urgent from routine activity, the CLI gains full schedule management, memory routing is on by default, Fireworks joins the managed provider roster, and the web platform recognizes self-hosted assistants.

Highlights

  • Home feed redesigned with urgency routing: urgent items go to Inbox, routine items to Activity, every row shows timestamps and unread weight, read rows fade back, and the Activity section collapses to keep your Inbox focused
  • Notification urgency is now first-class: urgent signals get OS banners and Inbox placement, routine signals route to Activity silently, and assistant notifications send --urgent lets your own scripts control urgency
  • Full schedule management from the CLI: create, cancel, enable, disable, and delete scheduled tasks with new assistant schedules subcommands, complete CRUD without opening the app
  • Memory v2 router is on by default: the Sonnet-based memory router now activates for managed Anthropic installs with a balanced profile, no feature flag needed, retrospective timestamps reflect your local timezone, and transcripts use participant display names
  • Fireworks is now a managed connection: use any Fireworks model without supplying your own API key, the same way managed Anthropic, OpenAI, and Gemini connections work
  • Self-hosted assistants are now recognized in the web app: if you are running the open-source daemon locally, the platform shows a dedicated screen with a link to settings instead of attempting to start a cloud instance

v0.8.2

A cross-platform polish release: subagents get a cleaner design and iOS gets a critical ChatComposer reliability fix.

Highlights

  • Subagent UI redesigned to match macOS: cleaner layout, full-screen detail panel on mobile, and consistent design language across platforms
  • iOS ChatComposer reliability fix: resolved an infinite update loop triggered by WKWebView re-dispatched input events
  • Usage page redesigned with clearer information hierarchy, descending-sorted stacked-bar tooltips, and improved aesthetics

v0.8.1

A focused release on a redesigned model profiles and provider connections system, Twilio phone calling for platform-hosted assistants, one-click Publish to Vercel from the app builder, iOS push notifications, a smarter memory pass, and a tightened iOS login flow.

Highlights

  • Redesigned model profiles and provider connections: mix managed and bring-your-own connections per profile, edit labels and status inline, save tweaks as new variants, and switch profiles mid-conversation with the new /model slash command
  • Twilio phone calling for platform-hosted assistants: your assistant can now answer and place real phone calls when running on the Vellum platform, so you can reach it from anywhere without opening an app
  • Publish to Vercel from the app builder: when your assistant builds you an app, it can now deploy it to Vercel for you in one step, with complex apps handed off to the assistant for end-to-end deployment and live URLs surfaced in the deploy toast
  • iOS push notifications are live: with the latest staging iOS app, your assistant can send native APNs notifications to your phone for activity completions, scheduled wakeups, and turn-end events you'd otherwise miss on mobile
  • Memory v2.5 under the hood: a background retrospective pass reflects on recent conversations, a Sonnet-based router replaces dynamic context injection for memory recall, and per-conversation memory state forks cleanly with the conversation. No visible UX changes, just memory that feels right more often
  • iOS login flow polish: a single Sign In button on the Capacitor login screen, native OAuth completion now lands through a deep-link router instead of falling back to popups, conversational OAuth opens in SFSafariViewController, invalid_grant errors prompt a reconnect instead of silently failing, and the onboarding name is pre-filled from your authenticated profile
  • Tavily joins the web search provider list, giving you another option alongside the existing providers
  • Quality-of-life polish: a Discord community card and chat banner make it easier to find the Vellum community, assistant changelog brings release notes into the CLI, NOW.md injection is now configurable, the macOS provider editor gained a Save as New button, Google OAuth setup is now single-step with inline credential collection, and gateway logs are pretty-printed with a JSONL sidecar for log-tail

v0.8.0

A major intelligence and cross-platform release: Memory v2 is now live, the web app can reach back to your Mac for local actions, managed integrations expanded, and the iOS experience is smoother across login, settings, attachments, and notifications.

Highlights

  • Memory v2 is live: your assistant now builds richer concept pages instead of isolated fact snippets, retrieves the right context more reliably, consolidates what it learns in the background, and gives you cleaner memory surfaces for browsing and searching what it knows
  • Use your Mac from the web app: when your desktop assistant is connected, web conversations can route local actions back through your Mac for workspace files, terminal commands, browser control, and other host-side work without forcing you to switch clients
  • Live integrations now include Discord, GitHub, Google, HubSpot, Linear, Notion, Twitter, Asana, Outlook / Microsoft, and Todoist, with smoother native OAuth handoff on iOS and desktop
  • Model settings now include Fireworks and OpenRouter in the provider catalog, making it easier to configure assistants that use your preferred model provider
  • iOS got a serious polish pass: Sign in with Apple, native OAuth completion, APNs push notification groundwork, better mobile settings layouts, improved attachment preview and download behavior, file downloads through the native share sheet, and cleaner login screens
  • Everyday reliability improved across notifications, trust-rule chips, document persistence, skill loading, gateway routing, OAuth setup, memory maintenance, and macOS signing and update flows

v0.7.3

Highlights

  • Improved memory retrieval routing for more reliable and efficient background memory lookups
  • Under-the-hood stability improvements to how the assistant accesses and recalls stored information
  • General reliability fixes to the memory system to reduce retrieval inconsistencies

v0.7.2

A broad reliability release focused on the Chrome extension and cloud connection path, macOS workflow polish, contact and communication flows, heartbeat and schedule reliability, and tighter gateway security.

Highlights

  • Chrome extension and cloud reliability improved across the board: requests now send fresh session tokens consistently, platform API calls use the right headers and base URLs, self-hosted pairing is fixed, and the extension is prioritized over the macOS SSE bridge for browser control
  • Gateway and security hardening tightened local and cloud boundaries, including stricter loopback handling, known-origin checks for pairing and CORS, safer token refresh behavior, secret redaction before recall evidence reaches the model, and reduced elevated capabilities in Docker-mode assistant containers
  • Contact, messaging, and document workflows got smoother with contact prompt commands and macOS panels, gateway-owned Slack contact upserts, missed @mention catch-up after socket reconnects, stronger trusted contact checks, and PDF export for Document Writer
  • Conversation workflows are more resilient with reliable notification deep links, Mark as read in the conversation context menu, a cleaner conversation-switch loading skeleton, visible trust-rule save errors, and the new burst-based Worked for X.Ys activity model
  • Open-source visibility improved in the macOS app with a new Settings card for the public GitHub repo and a View on GitHub link in the About panel
  • Scheduling, heartbeat, and operations are more robust: scheduled tasks can retry after failures, heartbeat runs use cron-style timing with missed-run detection, SSE disconnects are detected faster with a heartbeat watchdog, and assistant gateway logs tail makes gateway debugging easier

v0.7.1

A focused polish release with stronger Language Model controls, more reliable trust rule editing, Chrome extension fixes, macOS quality-of-life improvements, and credential infrastructure upgrades for smoother local and cloud assistant setups.

Highlights

  • Language Model controls got more precise: profile-specific context budgets, model-aware max token sliders, refreshed context metadata, and effective context handling across the main agent loop, wake paths, and slash commands make model profiles behave more predictably
  • Your Own mode is more reliable: explicit user saves are now respected, main agent profiles can override static defaults, profile context and slider fallbacks are preserved, and actual provider metadata is stamped correctly when routing picks a non-default provider
  • Trust rules and approvals are easier to understand: rule editor copy now shows natural language instead of structured tool data for non-bash tools, trust badge taps open the right edit flow, and approval provenance is tracked with mode, reason, and risk threshold fields
  • Chrome extension reliability improved with fixes for status probing, activity isolation, privileged tabs, popup icon paths, self-hosted UX, and hostname handling across local and cloud modes
  • macOS polish and workflow fixes: turn-end notifications now fire when the app is unfocused, task progress widgets default inline with a pop-out option, the profile editor is cleaner, toolbar state is better isolated from conversation list updates, and account deletion requests can be started from the app
  • Credential and CLI infrastructure moved forward with credential account key normalization, a direct CES credential management CLI, API key migration work, and a new vellum upgrade --latest flag for pulling the newest available version

v0.7.0

A redesigned trust and permissions system with a new v3 rules engine, Gemini 3 model support, GPT-5.5 as the new default OpenAI model, and a wide range of new CLI commands and a reworked browser extension experience.

Highlights

  • Redesigned trust and permissions system: a new v3 Trust Rules engine replaces the older permissions model, with cleaner presets (Conservative, Relaxed, Autonomous), suggested rules with an Allow & Create Rule button, directory-scoped rules, and a fully redesigned Trust Rules management UI
  • Gemini 3 model support and catalog improvements: Gemini 3 models are now available with correct pricing, thought signature capture, and tool-call metadata, the default OpenAI model is now GPT-5.5, and OpenAI reasoning effort can now be explicitly disabled or set to an extra-high tier
  • Expanded CLI: new commands for inspecting installed skills, managing trust rules, registering and listing webhooks, setting and getting environment variables, verbose exec, and SSH and exec support for managed instances
  • Reworked browser extension: a proper onboarding flow, SSE-based event transport, and direct gateway pairing replacing the legacy native messaging host, for a smoother and more reliable connection between your assistant and Chrome

v0.6.5

The Vellum browser extension is now one-click install from the Chrome Web Store, X/Twitter is generally available as a managed integration, and Slack setup is one click with expanded permissions so your assistant can triage your messages, not just post. Plus a new Voice settings tab on the web platform, a conversation Refresh action, and a wide range of macOS stability and polish fixes.

Highlights

  • Vellum browser extension on the Chrome Web Store: install the Vellum Assistant Browser Relay from the Chrome Web Store in one click, no more loading an unpacked extension in developer mode. The extension bridges your assistant to your live Chrome tabs for reading, clicking, filling, and extracting on any site you are already signed into, and now also supports cloud sign-in directly from the extension
  • X/Twitter integration is generally available: connect X to your assistant through the managed integration without toggling a flag, with expanded scopes so your assistant can read likes and bookmarks in addition to posting and browsing
  • Slack is more capable and easier to set up: creating a Slack app is now a single step with the redirect URL baked into the manifest, and the install flow has been simplified so broken links during app creation are gone. Permissions were also expanded so your assistant can now read your Slack messages for triage and summarization in addition to posting, and the bot automatically joins public channels instead of requiring an invite
  • Voice settings on the web platform: the web platform now has a dedicated Voice settings tab matching the macOS app, with push-to-talk presets, and the Test button in voice settings is always enabled so you can preview voices at any time
  • Conversation Refresh action: a new Refresh action in the conversation menu lets you reload a conversation when its state gets out of sync with the server
  • macOS stability and polish: dedicated network session for background streaming to fix stalled responses, cold-launch avatar caching, fixed scroll white space and push-to-top jitter during streaming, sidebar skeleton that waits for restoration instead of a timer, smoother home panel transitions, and a wide range of reliability improvements across memory compaction, onboarding, credentials, and OAuth

v0.6.4

Claude Opus 4.7 support, a migration to OpenAI's newer Responses API, major macOS performance fixes, a smoother chat scroll experience, configurable log retention, improved Gmail cleanup, and broad stability polish.

Highlights

  • Claude Opus 4.7 support: the newest Anthropic model is now available across the app and is automatically used when your assistant needs its strongest quality-oriented reasoning
  • OpenAI provider moved to the Responses API: upgraded the underlying connection to OpenAI models for better streaming, tool calls, and compatibility with newer features
  • Major macOS performance improvements: resolved multiple two-second-plus app hangs caused by layout, font, avatar, sound, and menu-bar initialization, plus a new inverted scroll architecture for noticeably smoother chat scrolling
  • Configurable LLM log retention: choose how long request logs are kept on your device from Settings > Permissions & Privacy. Options are 1, 7, 30, or 90 days, or never expire
  • Faster, more accurate voice transcription: Google Gemini speech-to-text now streams over the Live API for real-time partial transcripts, with support for speaker labels when the provider offers them
  • Smarter Gmail cleanup: persistent blocklist and safelist preferences, a new cold-outreach workflow with automatic classification and enrichment, and more reliable archiving behavior
  • Conversation archive: archiving and unarchiving conversations now syncs reliably with the server, with archived items sorted by when they were archived
  • Assistant thinking in progress cards: the progress card now includes a thinking sub-row and factors thinking time into the total duration, so you can see where the assistant is spending its time
  • Polish and stability fixes: cleaner thinking block layout within the chat column, Reflections grouped under Background in the sidebar, wider model names in the Usage breakdown, a fixed Web Search API key field, smoother onboarding, and a wide range of reliability improvements across memory, credentials, and OAuth

v0.6.3

Dramatically improved chat performance and stability on macOS, a redesigned onboarding flow, real assistant names throughout the UI, a refreshed integrations panel, and broad UX polish.

Highlights

  • Dramatically improved chat performance and stability on macOS: resolved a wide range of rendering hangs, layout freezes, and scroll issues, including fixes for blank chat on conversation switch, streaming scroll corrections, and smoother message rendering overall
  • Redesigned onboarding flow: the setup experience now focuses on what you want your assistant to do and how it should behave, replacing the previous personality quiz with a more practical, goal-oriented approach
  • Assistant identity improvements: assistants now display real names instead of IDs throughout the app, with random name generation for new assistants and the ability to edit role and description directly from the Identity panel
  • Refreshed integrations panel: the integrations page has been redesigned with a cleaner grid layout and moved into Settings as a dedicated tab, with provider logos for easier recognition
  • Smoother scrolling and chat polish: a new scroll model brings smoother conversation scrolling, better alignment throughout the chat, animated conversation names on hover, a new sidebar groups divider, and archived conversations sorted by archive time
  • Memory and context improvements: memory recall is now more configurable, consolidation is less aggressive to prevent memory loss, and deleted memories are now recoverable
  • Broad stability and UX fixes: simplified permission controls, skill previews now work before installation, Slack gained /new command support, and numerous fixes across modals, attachments, sidebars, and the composer

v0.6.2

Introduces a referral program for earning credits, Linear integration, cleaner thinking blocks, configurable knowledge base injection, improved API key management, a polished chat and sidebar experience, and broad stability fixes.

Highlights

  • Referral program: invite friends with your unique referral link and earn credits when they sign up. Accessible from the Billing tab and the sidebar, with stats tracking and program details built in
  • Linear integration: connect to Linear through the platform without configuring your own OAuth app, enabled by default during onboarding alongside Outlook
  • Thinking blocks rendered as markdown and collapsed by default: the assistant's reasoning is now displayed as clean, formatted text that starts collapsed, making responses easier to read
  • Configurable knowledge base injection: control which knowledge base files are automatically included in conversations, with fixes for duplicate and out-of-order content
  • Improved API key management: a more reliable system for storing and reading API keys across settings and services
  • Chat and sidebar polish: a new animated typing indicator, spell checking in the composer, a Recents group replacing ungrouped conversations, and cleaner sidebar text handling
  • Stability and correctness fixes: numerous fixes across scrolling, history loading, dictation, audio handling, and OAuth flows, reducing crashes and stale data issues

v0.6.1

Introduces the Personal Knowledge Base for reliable fact recall, a refreshed design system, major macOS chat performance fixes, background agents for parallel task execution, and numerous stability improvements.

Highlights

  • Personal Knowledge Base (PKB) introduced: your assistant now has a persistent memory system that files and retrieves important information across conversations, with a fully redesigned Memory inspector showing what your assistant remembers and how confident it is
  • Refreshed design system: updated color palette, new typography, and polished components throughout the app for a cleaner, more modern look
  • Major chat performance improvements: resolved several severe slowdowns on macOS, including multi-minute freezes when switching conversations and layout bottlenecks that caused the app to hang
  • Background agents for parallel work: your assistant can now spawn background agents to handle tasks independently, with different roles for different types of work like research, coding, or planning, and the ability to report results back when finished
  • Stability and rendering fixes: smoother thinking block animations, improved markdown formatting, better sidebar behavior, and more reliable scrolling throughout the app

v0.6.0

The biggest release yet: Vellum goes open source, introduces platform-hosted assistants, a completely revamped memory system with image support, Outlook feature parity, conversation folders, and proactive assistant check-ins.

Highlights

  • Open source launch: the Vellum Assistant repository is now publicly available, inviting the community to explore, contribute, and build on the platform
  • Platform-hosted assistants: assistants can now run fully hosted on the Vellum platform, removing the need for local infrastructure and enabling seamless cloud-based operation
  • Revamped memory system with image support: the memory system has been completely rebuilt with support for image references, smarter search that combines multiple retrieval strategies, and more reliable memory consolidation
  • Outlook Calendar and Email reach full feature parity with Google: Outlook Calendar and Outlook Email integrations are now generally available, matching the functionality previously available only for Google Calendar and Gmail
  • Conversation folders and sidebar improvements: organize conversations into folders with automatic grouping by source, improved icons and count badges, and easy group management
  • Proactive assistant check-ins: your assistant can now periodically review its notes, reflect on recent conversations, and reach out when it has something worth sharing, enabled by default
  • Skills system redesign: skills have been rebuilt with better discovery, easier installation, and more reliable behavior across the board
  • Performance and stability improvements across macOS and iOS: faster conversation loading, smoother scrolling, and reduced memory usage through better caching and background processing

v0.5.16

Major macOS performance and stability improvements, Outlook messaging support, smarter assistant memory, security hardening, and polished UI components.

Highlights

  • Significant macOS performance and stability improvements: fixes for chat scroll freezes, sidebar lag, and app responsiveness issues, resulting in a noticeably smoother experience
  • Outlook messaging support: Vellum can now connect to Microsoft Outlook as a messaging provider, joining the existing Slack integration and expanding where your assistant can be reached
  • Smarter assistant memory: the assistant now remembers its capabilities and available tools at startup, with improved search for finding relevant memories
  • Security hardening: removed the ability to bypass permission prompts, tightened access controls across the app, and added stricter validation for sensitive operations
  • Polished UI components: redesigned skill detail page, improved file browser, better dropdowns and navigation, and a context window indicator showing how much conversation space is left

v0.5.15

Signing key handling improvements and automatic migration for smoother upgrades.

Highlights

  • Improved signing key handling to prevent potential authentication issues during normal operation
  • Automatic key migration when upgrading from older versions, ensuring a smooth upgrade experience without manual steps

v0.5.14

Thinking blocks in chat, overhauled memory and retrieval, /compact command, expanded model support, and collapsible sidebar sections.

Highlights

  • Thinking blocks are now visible in chat: see your assistant's reasoning process inline as collapsible blocks, giving you transparency into how responses are formed, enabled by default
  • Significantly improved memory and retrieval: smarter memory extraction, better search diversity to surface unexpectedly relevant memories, and improved formatting of recalled information
  • New /compact command and context window indicator: manually trigger conversation compaction at any time, with a color-coded bar in the toolbar showing how much context space is remaining
  • Expanded model support: DeepSeek, Qwen, Mistral, Meta, Moonshot, and Amazon models added through OpenRouter; Anthropic 1M context window beta and fast mode now supported
  • Collapsible sidebar sections and channel conversations: Scheduled and Background sidebar sections can now be collapsed, and channel-bound conversations display with a read-only indicator

The Personal AI you were promised

GET STARTED