Vellum is coming to the AI Engineering World's Fair in SF. Come visit our booth and get a live demo!

Vellum Product Update | July 2024

Learn about the latest features and improvements shipped by the Vellum team in July.

Written by
Reviewed by
No items found.

August has arrived, and with it comes a whole host of improvements to Vellum!

We've added some great new features like Prompt Variable Chips and Prompt Node Usage in Workflows. Plus, we've packed in an assortment of improvements to Workflows, Evaluations, and Deployment help with your AI development flow.

Let's take a closer look at our fav features introduced this month.

Prompt Variable Chips

Previously, you had to use {{ myVariable }} syntax to reference variables in Prompts. While the doubly-curly syntax is great for more complex Jinja templating, it can be overkill for simple variable substitution. It's harder to read, conflicts with JSON syntax, and requires manual updates when renaming variables.

To simplify this, we've introduced Variable Chips.

These are small, clickable chips you can add by typing **{{** or /. Renaming a variable updates all references automatically.

Variable Chips work in the new "Rich Text" block type. New Prompt blocks default to Rich Text, but you can convert between existing Jinja blocks and the new Rich Text block by selecting the block type dropdown in the toolbar.

Check the demo here.

Prompt Node Usage in Workflows

Previously, when running Prompts in the Workflow Sandbox, you couldn't see token counts and other usage metrics in the Prompt Node results.

Now, token counts are shown alongside a Prompt Node’s results within Workflow Sandboxes.  This setting is now on by default, but can be toggled off in the Workflow Builder Settings.

You can also now return usage data when invoking a Workflow Deployment via API, by passing in True to the expand_meta.usage parameter on either Execute Workflow endpoints.

stream = client.execute_workflow_stream(
  workflow_deployment_name="demo",
  inputs=[
    WorkflowRequestInputRequest_String(
      type="STRING",
      name="foo",
      value="bar",
    ),
  ],
  event_types=["WORKFLOW", "NODE"],
  expand_meta=WorkflowExpandMetaRequest(
    usage=True
  )
)

for event in stream:
  if event.type == "NODE" and event.data.state == "FULFILLED":
    node_result_data = event.data.data
    if node_result_data and node_result_data.type == "PROMPT":
      print(node_result_data.data.execution_meta.usage)

Metadata Filtering in Search Nodes

For a while now, you’ve been able to add structured JSON metadata to Documents and filter against it when making API calls to search a Document index (see here for more info). However, this wasn't possible via Search Nodes within the Workflow UI. You had to use a Code Node or API Node to call Vellum’s Search API manually.

Now, we’re happy to share that the UI is at parity with the API. You’ll be able to construct arbitrarily complex boolean logic using the new Metadata Filters section of the Search Node’s Advanced settings.

Other Workflows Updates

Enable/Disable All Workflow Node Mocks

Mocking Prompt Nodes helps to save token usage and iteration time when developing the later stages of your Workflow. However, once you’re happy with your Workflow, it’s often useful to run the Workflow end-to-end to make sure it all comes together.

Previously you had to enable/disable each mock individually, but this month we added support to enable/disable all mocks within a Workflow at once.

This feature simplifies the process of testing and debugging, allowing you to quickly switch between real and mocked data without issue.

Constant Values in Workflow Node Inputs

It’s often the case that you might want to specify a constant value as a Workflow Node Input, either as the input’s primary value or as its fallback value. This required cumbersome workarounds before (i.e. referencing Input Variables our the outputs of Templating Nodes).

Now, you can inline constant values directly within a Workflow Node input!

Read how to do it here.

Other Evaluations Updates

Test Suite Test Case External IDs

Previously, there was no straightforward way to sync Test Cases with external systems. Now, you can optionally assign an external ID to each Test Case upon uploading them to Vellum to make it easier to upsert changes later, keying off of that ID.

Support for Bulk Upserting Test Suite Test Cases via API

For a while now we’ve had an API for creating, replacing, and deleting Test Cases in a Test Suite in bulk. We now support a fourth operation in this API – upsert. With upsert, you can provide an external_id and a Test Case payload. If there is already a Test Case with that external_id, it’ll be replaced. Otherwise, it’ll be created.

This new operation is available in our SDKs starting v0.6.12.

Test Case CSV Upload in Evaluation Reports

Previously, if you wanted to upload Test Cases, you had to first navigate to the Test Suite itself and upload from there - making this process a lot more complex than it should be.

You can now upload Test Cases to a Test Suite directly from the Evaluations tab of a Prompt or Workflow. Just click the “Upload Test Cases” button in the header of any Evaluations table.

New Layout for Sandbox Evaluations

The previous layout for Evaluations should all Test Suites at once. This made the page cluttered, difficult to navigate, and sometimes laggy. We've updated the page layout to display one Test Suite at a time with a searchable select input.

Other Prompts Updates

Auto-Conversion to Variable Chips on Paste

You can now copy/paste variables across Prompt Blocks of different types.

If you copy text with a {{ my_var }} variable from a Jinja block and paste it into a Rich Text block, it will automatically turn into a variable chip.

Improvements to Prompt Chat History Variables

Previously, Prompts with dynamic Chat History needed an input variable specifically named $chat_history. This was understandably confusing for new customers.

Now, you can name Chat History input variables anything you want and even rename them later. We've also centralized input variable definitions, so you can create a String or Chat History variable using the "Add" button in the "Input Variables" section of the Prompt Editor.

Other Deployments Updates

Expandable Meta Params in Retrieve Provider Payload Endpoint

For a while now, we've had an API to compile a Prompt and get the exact payload Vellum sends to a model provider. Now, there's a new parameter called expand_meta. With this parameter, you can get extra metadata about the compiled prompt payload. Check our API docs to see which fields are expandable.

New “Add Document to Document Index” API

We’ve introduced a new API for adding previously uploaded Documents to a Document Index. This API is useful when you have a Document that had previously been added to one Document Index and you want to add it to another without having to re-upload its contents altogether.

It’s available in our SDKs beginning version 0.6.10. You can find docs for this new API here.

Prompt Deployment Executions Table Improvements

We've improved the Prompt Deployment Executions table by making it easier to quickly edit the “Desired Output” and “Quality” columns. This is helpful if you have a team of in-house data labelers that are providing feedback on the quality of your AI systems’ outputs.

New Models

Google Vertex AI Support

We now support Google Vertex AI models. Previously you could only use Google AI Studio for using Google’s models. You can add them to your workspace from the models page.

Llama 3.1 on Groq

Meta’s newest Llama 3.1 models are now available in Vellum through our Groq integration!

GPT-4o Mini

OpenAI’s newest GPT-4o Mini models gpt-4o-mini & gpt-4o-mini-2024-07-18 are now available in Vellum and have been added to all workspaces!

Quality of Life Improvements

Index Page Sorting

We've added a “Sort by” dropdown to sort folders and entities by created date, modified date, and label. We hope this gives you more control over how you organize and view your data.

Deployed Prompt Variant Display

When on the Prompt Deployment Overview page, you can now see the name of the Prompt Variant that’s been deployed. This is useful if your Prompt Sandbox has multiple Prompt Variants that you were comparing against one another and you’re not sure which one is currently deployed.

Copyable Text to Clipboard

We’ve introduced the ability to copy Prompt Variant IDs, Document Indexes, Models, Workflow Deployment Names and IDs, Document Keys, and Prompt Deployment Names and IDs to clipboard.

This feature comes with an enhanced UI with intuitive indicators and tooltips for copyable fields.

Index Page List View

You can now toggle how entities are displayed between two modes – Card (the default) and List view. List view can be helpful if you have many entities and want to see more of them at once.

Collapsible Index Page Sections

You can now collapse sections index pages for Prompts, Documents, Test Suites, and Workflows. Simply click the heading of any section to toggle the visibility of all folders and items within that section.

Looking ahead

We hope these updates enhance your experience and productivity. As always, we look forward to your feedback and are excited to see what you'll build next!

August will bring so many exciting features and we can’t wait to share those with you.

ABOUT THE AUTHOR
Noa Flaherty
Co-founder & CTO

Noa Flaherty, CTO and co-founder at Vellum (YC W23) is helping developers to develop, deploy and evaluate LLM-powered apps. His diverse background in mechanical and software engineering, as well as marketing and business operations gives him the technical know-how and business acumen needed to bring value to nearly any aspect of startup life. Prior to founding Vellum, Noa completed his undergrad at MIT and worked at three tech startups, including roles in MLOps at DataRobot and Product Engineering at Dover.

ABOUT THE reviewer

No items found.
lAST UPDATED
Aug 6, 2024
share post
Expert verified
Related Posts
Guides
October 21, 2025
15 min
AI transformation playbook
LLM basics
October 20, 2025
8 min
The Top Enterprise AI Automation Platforms (Guide)
LLM basics
October 10, 2025
7 min
The Best AI Workflow Builders for Automating Business Processes
LLM basics
October 7, 2025
8 min
The Complete Guide to No‑Code AI Workflow Automation Tools
All
October 6, 2025
6 min
OpenAI's Agent Builder Explained
Product Updates
October 1, 2025
7
Vellum Product Update | September
The Best AI Tips — Direct To Your Inbox

Latest AI news, tips, and techniques

Specific tips for Your AI use cases

No spam

Oops! Something went wrong while submitting the form.

Each issue is packed with valuable resources, tools, and insights that help us stay ahead in AI development. We've discovered strategies and frameworks that boosted our efficiency by 30%, making it a must-read for anyone in the field.

Marina Trajkovska
Head of Engineering

This is just a great newsletter. The content is so helpful, even when I’m busy I read them.

Jeremy Hicks
Solutions Architect

Experiment, Evaluate, Deploy, Repeat.

AI development doesn’t end once you've defined your system. Learn how Vellum helps you manage the entire AI development lifecycle.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Build AI agents in minutes with Vellum
Build agents that take on the busywork and free up hundreds of hours. No coding needed, just start creating.

General CTA component, Use {{general-cta}}

Build AI agents in minutes with Vellum
Build agents that take on the busywork and free up hundreds of hours. No coding needed, just start creating.

General CTA component  [For enterprise], Use {{general-cta-enterprise}}

The best AI agent platform for enterprises
Production-grade rigor in one platform: prompt builder, agent sandbox, and built-in evals and monitoring so your whole org can go AI native.

[Dynamic] Ebook CTA component using the Ebook CMS filtered by name of ebook.
Use {{ebook-cta}} and add a Ebook reference in the article

Thank you!
Your submission has been received!
Oops! Something went wrong while submitting the form.
Button Text

LLM leaderboard CTA component. Use {{llm-cta}}

Check our LLM leaderboard
Compare all open-source and proprietary model across different tasks like coding, math, reasoning and others.

Case study CTA component (ROI)

40% cost reduction on AI investment
Learn how Drata’s team uses Vellum and moves fast with AI initiatives, without sacrificing accuracy and security.

Case study CTA component (cutting eng overhead) = {{coursemojo-cta}}

6+ months on engineering time saved
Learn how CourseMojo uses Vellum to enable their domain experts to collaborate on AI initiatives, reaching 10x of business growth without expanding the engineering team.

Case study CTA component (Time to value) = {{time-cta}}

100x faster time to deployment for AI agents
See how RelyHealth uses Vellum to deliver hundreds of custom healthcare agents with the speed customers expect and the reliability healthcare demands.

[Dynamic] Guide CTA component using Blog Post CMS, filtering on Guides’ names

100x faster time to deployment for AI agents
See how RelyHealth uses Vellum to deliver hundreds of custom healthcare agents with the speed customers expect and the reliability healthcare demands.
New CTA
Sorts the trigger and email categories

Dynamic template box for healthcare, Use {{healthcare}}

Start with some of these healthcare examples

Population health insights reporter
Combine healthcare sources and structure data for population health management.
Prior authorization navigator
Automate the prior authorization process for medical claims.

Dynamic template box for insurance, Use {{insurance}}

Start with some of these insurance examples

Insurance claims automation agent
Collect and analyze claim information, assess risk and verify policy details.
AI agent for claims review
Review healthcare claims, detect anomalies and benchmark pricing.
Agent that summarizes lengthy reports (PDF -> Summary)
Summarize all kinds of PDFs into easily digestible summaries.

Dynamic template box for eCommerce, Use {{ecommerce}}

Start with some of these eCommerce examples

E-commerce shopping agent
Check order status, manage shopping carts and process returns.

Dynamic template box for Marketing, Use {{marketing}}

Start with some of these marketing examples

ReAct agent for web search and page scraping
Gather information from the internet and provide responses with embedded citations.
Competitor research agent
Scrape relevant case studies from competitors and extract ICP details.

Dynamic template box for Sales, Use {{sales}}

Start with some of these sales examples

Research agent for sales demos
Company research based on Linkedin and public data as a prep for sales demo.

Dynamic template box for Legal, Use {{legal}}

Start with some of these legal examples

Legal RAG chatbot
Chatbot that provides answers based on user queries and legal documents.
AI legal research agent
Comprehensive legal research memo based on research question, jurisdiction and date range.

Dynamic template box for Supply Chain/Logistics, Use {{supply}}

Start with some of these supply chain examples

Risk assessment agent for supply chain operations
Comprehensive risk assessment for suppliers based on various data inputs.

Dynamic template box for Edtech, Use {{edtech}}

Start with some of these edtech examples

Turn LinkedIn Posts into Articles and Push to Notion
Convert your best Linkedin posts into long form content.

Dynamic template box for Compliance, Use {{compliance}}

Start with some of these compliance examples

No items found.

Dynamic template box for Customer Support, Use {{customer}}

Start with some of these customer support examples

Trust Center RAG Chatbot
Read from a vector database, and instantly answer questions about your security policies.
Q&A RAG Chatbot with Cohere reranking

Template box, 2 random templates, Use {{templates}}

Start with some of these agents

Financial Statement Review Workflow
Extract and review financial statements and their corresponding footnotes from SEC 10-K filings.
Healthcare explanations of a patient-doctor match
Summarize why a patient was matched with a specific provider.

Template box, 6 random templates, Use {{templates-plus}}

Build AI agents in minutes

Insurance claims automation agent
Collect and analyze claim information, assess risk and verify policy details.
Turn LinkedIn Posts into Articles and Push to Notion
Convert your best Linkedin posts into long form content.
Clinical trial matchmaker
Match patients to relevant clinical trials based on EHR.
LinkedIn Content Planning Agent
Create a 30-day Linkedin content plan based on your goals and target audience.
Competitor research agent
Scrape relevant case studies from competitors and extract ICP details.
Retail pricing optimizer agent
Analyze product data and market conditions and recommend pricing strategies.

Build AI agents in minutes for

{{industry_name}}

Clinical trial matchmaker
Match patients to relevant clinical trials based on EHR.
Prior authorization navigator
Automate the prior authorization process for medical claims.
Population health insights reporter
Combine healthcare sources and structure data for population health management.
Legal document processing agent
Process long and complex legal documents and generate legal research memorandum.
Legal contract review AI agent
Asses legal contracts and check for required classes, asses risk and generate report.
Legal RAG chatbot
Chatbot that provides answers based on user queries and legal documents.

Case study results overview (usually added at top of case study)

What we did:

1-click

This is some text inside of a div block.

28,000+

Separate vector databases managed per tenant.

100+

Real-world eval tests run before every release.