Search...

Index

Inline evaluation / Guardrails: Ensure good system performance at run-time

This is some text inside of a div block.

Vellum Product Update | July 2024

Learn about the latest features and improvements shipped by the Vellum team in July.

Author

Noa Flaherty

Aug 6, 2024

August has arrived, and with it comes a whole host of improvements to Vellum!

We've added some great new features like Prompt Variable Chips and Prompt Node Usage in Workflows. Plus, we've packed in an assortment of improvements to Workflows, Evaluations, and Deployment help with your AI development flow.

Let's take a closer look at our fav features introduced this month.

Prompt Variable Chips

Previously, you had to use {{ myVariable }} syntax to reference variables in Prompts. While the doubly-curly syntax is great for more complex Jinja templating, it can be overkill for simple variable substitution. It's harder to read, conflicts with JSON syntax, and requires manual updates when renaming variables.

To simplify this, we've introduced Variable Chips.

These are small, clickable chips you can add by typing **{{** or /. Renaming a variable updates all references automatically.

Variable Chips work in the new "Rich Text" block type. New Prompt blocks default to Rich Text, but you can convert between existing Jinja blocks and the new Rich Text block by selecting the block type dropdown in the toolbar.

Check the demo here.

Prompt Node Usage in Workflows

Previously, when running Prompts in the Workflow Sandbox, you couldn't see token counts and other usage metrics in the Prompt Node results.

Now, token counts are shown alongside a Prompt Node’s results within Workflow Sandboxes. This setting is now on by default, but can be toggled off in the Workflow Builder Settings.

You can also now return usage data when invoking a Workflow Deployment via API, by passing in True to the expand_meta.usage parameter on either Execute Workflow endpoints.

stream = client.execute_workflow_stream(
  workflow_deployment_name="demo",
  inputs=[
    WorkflowRequestInputRequest_String(
      type="STRING",
      name="foo",
      value="bar",
    ),
  ],
  event_types=["WORKFLOW", "NODE"],
  expand_meta=WorkflowExpandMetaRequest(
    usage=True
  )
)

for event in stream:
  if event.type == "NODE" and event.data.state == "FULFILLED":
    node_result_data = event.data.data
    if node_result_data and node_result_data.type == "PROMPT":
      print(node_result_data.data.execution_meta.usage)

‍

Metadata Filtering in Search Nodes

For a while now, you’ve been able to add structured JSON metadata to Documents and filter against it when making API calls to search a Document index (see here for more info). However, this wasn't possible via Search Nodes within the Workflow UI. You had to use a Code Node or API Node to call Vellum’s Search API manually.

Now, we’re happy to share that the UI is at parity with the API. You’ll be able to construct arbitrarily complex boolean logic using the new Metadata Filters section of the Search Node’s Advanced settings.

Other Workflows Updates

Enable/Disable All Workflow Node Mocks

Mocking Prompt Nodes helps to save token usage and iteration time when developing the later stages of your Workflow. However, once you’re happy with your Workflow, it’s often useful to run the Workflow end-to-end to make sure it all comes together.

Previously you had to enable/disable each mock individually, but this month we added support to enable/disable all mocks within a Workflow at once.

This feature simplifies the process of testing and debugging, allowing you to quickly switch between real and mocked data without issue.

Constant Values in Workflow Node Inputs

It’s often the case that you might want to specify a constant value as a Workflow Node Input, either as the input’s primary value or as its fallback value. This required cumbersome workarounds before (i.e. referencing Input Variables our the outputs of Templating Nodes).

Now, you can inline constant values directly within a Workflow Node input!

Read how to do it here.

‍

Other Evaluations Updates

Test Suite Test Case External IDs

Previously, there was no straightforward way to sync Test Cases with external systems. Now, you can optionally assign an external ID to each Test Case upon uploading them to Vellum to make it easier to upsert changes later, keying off of that ID.

Support for Bulk Upserting Test Suite Test Cases via API

For a while now we’ve had an API for creating, replacing, and deleting Test Cases in a Test Suite in bulk. We now support a fourth operation in this API – upsert. With upsert, you can provide an external_id and a Test Case payload. If there is already a Test Case with that external_id, it’ll be replaced. Otherwise, it’ll be created.

This new operation is available in our SDKs starting v0.6.12.

Test Case CSV Upload in Evaluation Reports

Previously, if you wanted to upload Test Cases, you had to first navigate to the Test Suite itself and upload from there - making this process a lot more complex than it should be.

You can now upload Test Cases to a Test Suite directly from the Evaluations tab of a Prompt or Workflow. Just click the “Upload Test Cases” button in the header of any Evaluations table.

New Layout for Sandbox Evaluations

The previous layout for Evaluations should all Test Suites at once. This made the page cluttered, difficult to navigate, and sometimes laggy. We've updated the page layout to display one Test Suite at a time with a searchable select input.

‍

Other Prompts Updates

Auto-Conversion to Variable Chips on Paste

You can now copy/paste variables across Prompt Blocks of different types.

If you copy text with a {{ my_var }} variable from a Jinja block and paste it into a Rich Text block, it will automatically turn into a variable chip.

Improvements to Prompt Chat History Variables

Previously, Prompts with dynamic Chat History needed an input variable specifically named $chat_history. This was understandably confusing for new customers.

Now, you can name Chat History input variables anything you want and even rename them later. We've also centralized input variable definitions, so you can create a String or Chat History variable using the "Add" button in the "Input Variables" section of the Prompt Editor.

‍

Other Deployments Updates

Expandable Meta Params in Retrieve Provider Payload Endpoint

For a while now, we've had an API to compile a Prompt and get the exact payload Vellum sends to a model provider. Now, there's a new parameter called expand_meta. With this parameter, you can get extra metadata about the compiled prompt payload. Check our API docs to see which fields are expandable.

New “Add Document to Document Index” API

We’ve introduced a new API for adding previously uploaded Documents to a Document Index. This API is useful when you have a Document that had previously been added to one Document Index and you want to add it to another without having to re-upload its contents altogether.

It’s available in our SDKs beginning version 0.6.10. You can find docs for this new API here.

Prompt Deployment Executions Table Improvements

We've improved the Prompt Deployment Executions table by making it easier to quickly edit the “Desired Output” and “Quality” columns. This is helpful if you have a team of in-house data labelers that are providing feedback on the quality of your AI systems’ outputs.

‍

New Models

Google Vertex AI Support

We now support Google Vertex AI models. Previously you could only use Google AI Studio for using Google’s models. You can add them to your workspace from the models page.‍

Llama 3.1 on Groq

Meta’s newest Llama 3.1 models are now available in Vellum through our Groq integration!

GPT-4o Mini

OpenAI’s newest GPT-4o Mini models gpt-4o-mini & gpt-4o-mini-2024-07-18 are now available in Vellum and have been added to all workspaces!

‍

Quality of Life Improvements

Index Page Sorting

We've added a “Sort by” dropdown to sort folders and entities by created date, modified date, and label. We hope this gives you more control over how you organize and view your data.

Deployed Prompt Variant Display

When on the Prompt Deployment Overview page, you can now see the name of the Prompt Variant that’s been deployed. This is useful if your Prompt Sandbox has multiple Prompt Variants that you were comparing against one another and you’re not sure which one is currently deployed.

Copyable Text to Clipboard

We’ve introduced the ability to copy Prompt Variant IDs, Document Indexes, Models, Workflow Deployment Names and IDs, Document Keys, and Prompt Deployment Names and IDs to clipboard.

This feature comes with an enhanced UI with intuitive indicators and tooltips for copyable fields.

Index Page List View

You can now toggle how entities are displayed between two modes – Card (the default) and List view. List view can be helpful if you have many entities and want to see more of them at once.

Collapsible Index Page Sections

You can now collapse sections index pages for Prompts, Documents, Test Suites, and Workflows. Simply click the heading of any section to toggle the visibility of all folders and items within that section.

Looking ahead

We hope these updates enhance your experience and productivity. As always, we look forward to your feedback and are excited to see what you'll build next!

August will bring so many exciting features and we can’t wait to share those with you.

August has arrived, and with it comes a whole host of improvements to Vellum!

Let's take a closer look at our fav features introduced this month.

Prompt Variable Chips

To simplify this, we've introduced Variable Chips.

These are small, clickable chips you can add by typing **{{** or /. Renaming a variable updates all references automatically.

Check the demo here.

Prompt Node Usage in Workflows

Previously, when running Prompts in the Workflow Sandbox, you couldn't see token counts and other usage metrics in the Prompt Node results.

Now, token counts are shown alongside a Prompt Node’s results within Workflow Sandboxes. This setting is now on by default, but can be toggled off in the Workflow Builder Settings.

You can also now return usage data when invoking a Workflow Deployment via API, by passing in True to the expand_meta.usage parameter on either Execute Workflow endpoints.

stream = client.execute_workflow_stream(
  workflow_deployment_name="demo",
  inputs=[
    WorkflowRequestInputRequest_String(
      type="STRING",
      name="foo",
      value="bar",
    ),
  ],
  event_types=["WORKFLOW", "NODE"],
  expand_meta=WorkflowExpandMetaRequest(
    usage=True
  )
)

for event in stream:
  if event.type == "NODE" and event.data.state == "FULFILLED":
    node_result_data = event.data.data
    if node_result_data and node_result_data.type == "PROMPT":
      print(node_result_data.data.execution_meta.usage)

‍

Metadata Filtering in Search Nodes

Other Workflows Updates

Enable/Disable All Workflow Node Mocks

Previously you had to enable/disable each mock individually, but this month we added support to enable/disable all mocks within a Workflow at once.

This feature simplifies the process of testing and debugging, allowing you to quickly switch between real and mocked data without issue.

Constant Values in Workflow Node Inputs

Now, you can inline constant values directly within a Workflow Node input!

Read how to do it here.

‍

Other Evaluations Updates

Test Suite Test Case External IDs

Support for Bulk Upserting Test Suite Test Cases via API

This new operation is available in our SDKs starting v0.6.12.

Test Case CSV Upload in Evaluation Reports

Previously, if you wanted to upload Test Cases, you had to first navigate to the Test Suite itself and upload from there - making this process a lot more complex than it should be.

You can now upload Test Cases to a Test Suite directly from the Evaluations tab of a Prompt or Workflow. Just click the “Upload Test Cases” button in the header of any Evaluations table.

New Layout for Sandbox Evaluations

‍

Other Prompts Updates

Auto-Conversion to Variable Chips on Paste

You can now copy/paste variables across Prompt Blocks of different types.

If you copy text with a {{ my_var }} variable from a Jinja block and paste it into a Rich Text block, it will automatically turn into a variable chip.

Improvements to Prompt Chat History Variables

Previously, Prompts with dynamic Chat History needed an input variable specifically named $chat_history. This was understandably confusing for new customers.

‍

Other Deployments Updates

Expandable Meta Params in Retrieve Provider Payload Endpoint

New “Add Document to Document Index” API

It’s available in our SDKs beginning version 0.6.10. You can find docs for this new API here.

Prompt Deployment Executions Table Improvements

‍

New Models

Google Vertex AI Support

We now support Google Vertex AI models. Previously you could only use Google AI Studio for using Google’s models. You can add them to your workspace from the models page.‍

Llama 3.1 on Groq

Meta’s newest Llama 3.1 models are now available in Vellum through our Groq integration!

GPT-4o Mini

OpenAI’s newest GPT-4o Mini models gpt-4o-mini & gpt-4o-mini-2024-07-18 are now available in Vellum and have been added to all workspaces!

‍

Quality of Life Improvements

Index Page Sorting

We've added a “Sort by” dropdown to sort folders and entities by created date, modified date, and label. We hope this gives you more control over how you organize and view your data.

Deployed Prompt Variant Display

Copyable Text to Clipboard

We’ve introduced the ability to copy Prompt Variant IDs, Document Indexes, Models, Workflow Deployment Names and IDs, Document Keys, and Prompt Deployment Names and IDs to clipboard.

This feature comes with an enhanced UI with intuitive indicators and tooltips for copyable fields.

Index Page List View

You can now toggle how entities are displayed between two modes – Card (the default) and List view. List view can be helpful if you have many entities and want to see more of them at once.

Collapsible Index Page Sections

Looking ahead

We hope these updates enhance your experience and productivity. As always, we look forward to your feedback and are excited to see what you'll build next!

August will bring so many exciting features and we can’t wait to share those with you.

ABOUT THE AUTHOR

Noa Flaherty

Co-founder & CTO

Noa Flaherty, CTO and co-founder at Vellum (YC W23) is helping developers to develop, deploy and evaluate LLM-powered apps. His diverse background in mechanical and software engineering, as well as marketing and business operations gives him the technical know-how and business acumen needed to bring value to nearly any aspect of startup life. Prior to founding Vellum, Noa completed his undergrad at MIT and worked at three tech startups, including roles in MLOps at DataRobot and Product Engineering at Dover.

No items found.

talk with an AI Expert

LLM basics

June 8, 2025

•

5 min

Big Ideas from the AI Engineer World’s Fair

LLM basics

June 1, 2025

•

8 min

Build AI Products Faster: Top Development Platforms Compared

Customer Stories

May 30, 2025

•

5 min

How GravityStack Cut Credit Agreement Review Time by 200% with Agentic AI

Guides

May 28, 2025

•

7 min

How the Best Product and Engineering Teams Ship AI Solutions

Model Comparisons

May 23, 2025

•

8 min

Evaluation: Claude 4 Sonnet vs OpenAI o4-mini vs Gemini 2.5 Pro

Guides

May 16, 2025

•

7 min

Document Data Extraction in 2025: LLMs vs OCRs

The Best AI Tips — Direct To Your Inbox

Latest AI news, tips, and techniques

Specific tips for Your AI use cases

No spam

Oops! Something went wrong while submitting the form.

Each issue is packed with valuable resources, tools, and insights that help us stay ahead in AI development. We've discovered strategies and frameworks that boosted our efficiency by 30%, making it a must-read for anyone in the field.

Marina Trajkovska

Head of Engineering