Harmstack helps teams evaluate medical AI quality with repeatable, structured benchmarks. Point Harmstack at your model endpoint, run a benchmark, and review results through theDocumentation Index
Fetch the complete documentation index at: https://docs.harmstack.ai/llms.txt
Use this file to discover all available pages before exploring further.
harmstack CLI or API.
Quick Start
Install the CLI with Homebrew or a release archive, then run your first benchmark in minutes.
CLI Reference
Explore every command and flag in the
harmstack CLI.API Authentication
Learn how to authenticate API requests with your API key.
API Endpoints
Browse the full public REST API reference.
How it works
Get your API key
Obtain your
HARMSTACK_API_KEY from Vetted Medical. This key authenticates all CLI commands and API requests.Install the CLI
Prebuilt binaries are published on GitHub:vettedmedical/harmstack-install-v0 — latest releasemacOS (Apple Silicon) and Linux (amd64): install with Homebrew or download the
.tar.gz for your platform. Windows (amd64): download the .zip and use harmstack.exe. Full steps are in Quick Start and CLI overview.Configure your model endpoint
Set your model endpoint and credentials in the wizard, or export defaults so you do not need to re-enter them each time:Harmstack supports multiple provider shapes, including
openai, openai_responses, gemini, and raw.What you can do
- Submit benchmarks - Run one-off or repeated evaluations against your model endpoint
- Inspect results - Review scores, metadata, and run details for each job
- Compare runs - Measure quality changes across model or prompt revisions
- Track trends - Monitor benchmark performance over time
- Automate in CI/CD - Run non-interactively with
--consentandskip
Admin-only API endpoints are intentionally excluded from this documentation. If you need admin access, contact Vetted Medical support.