Probefish v1.0.0 "Marlin": Production-Ready LLM Testing

Name: Probefish
Author: Probefish

We're excited to announce Probefish v1.0.0, codenamed Marlin. This release marks our transition to a production-ready platform with SaaS offerings while keeping self-hosted deployments free forever.

What's New

SaaS Launch with Self-Hosted Freedom

Probefish is now available as a managed cloud service with flexible plans. But here's the important part: self-hosted remains free forever. No feature gates, no artificial limitations. Deploy on your own infrastructure and get the full platform.

Why both options?

Cloud: Zero setup, automatic updates, we handle the infrastructure
Self-hosted: Full control, data stays on your servers, no ongoing costs

Per-Test-Case Validation Rules

Previously, validation rules applied to an entire test suite. Now you can set different validation rules for each test case.

Why it matters:

Different inputs need different validation. A test case for "Write a haiku" needs length constraints. A test case for "Explain quantum computing" needs keyword checks. Now each can have its own rules.

Example:

Test Case	Validation Rules
Generate haiku	Max length: 100, Contains line breaks
Explain REST APIs	Contains "HTTP", Contains "endpoint"
Translate to French	Regex: French characters present

No more one-size-fits-all validation.

Real-Time Streaming Results (SSE)

Waiting for all tests to complete before seeing results? Not anymore.

Probefish now uses Server-Sent Events (SSE) to stream results as each test case completes. Watch your test run progress in real-time:

Test 1/10: customer_greeting .......... PASSED (1.2s)
Test 2/10: product_query .............. PASSED (0.8s)
Test 3/10: refund_request ............. FAILED (validation)
Test 4/10: ... [running]

Benefits:

See failures immediately - don't wait for the full suite
Monitor long-running test suites without timeouts
Heartbeat keeps connections alive for extended runs

Smart JSON Detection

The "Edit Test Case" modal now automatically detects JSON in input fields and formats it accordingly.

Paste this:

{"user_id": "123", "query": "help"}

The editor recognizes it's JSON and:

Applies syntax highlighting
Validates JSON structure
Preserves formatting on save

No more switching between "text" and "JSON" modes manually.

URL-Based Tab Persistence

Ever refreshed the page and lost your place? Fixed.

Tab selections in test suites and projects now persist in the URL hash:

/projects/my-project/test-suites/auth-tests#results

Benefits:

Share links to specific tabs with teammates
Browser back/forward works naturally
Refresh doesn't reset your view

Duplicate Validation Rules

Found a validation rule you want to reuse? Click duplicate instead of recreating it from scratch.

Works for:

Contains / Excludes rules
Length constraints
Regex patterns
Response time limits

Small feature, big time saver when building comprehensive test suites.

Full SMTP Support + Magic Links

Email infrastructure got a complete overhaul:

SMTP Support: Connect any SMTP server - Gmail, SendGrid, AWS SES, your corporate mail server. No vendor lock-in.

Magic Links: Passwordless authentication is now fully supported. Users click a link in their email to sign in - no password required.

Configuration:

SMTP_HOST=smtp.example.com
SMTP_PORT=587
SMTP_USER=your-user
SMTP_PASS=your-password
SMTP_FROM=noreply@example.com

We removed the Resend dependency - you control your email delivery.

Bug Fixes

Test case checkbox fix: Previously, newly added test cases didn't show selection checkboxes until the suite was saved and the page reloaded. Now they appear immediately.
Various UI polish and minor improvements

What's Next

Version 1.1.0 is already in development with:

Multi-turn conversation testing - Test chat flows, not just single prompts
Human-readable identifiers - Use slugs instead of MongoDB IDs in CLI
UI skeleton loaders - Better perceived performance

Thank You

Marlin represents month of work toward a production-ready platform. Whether you're using Probefish Cloud or self-hosting, we're committed to building the best LLM testing tool available.

Questions? Issues? Open a GitHub issue or reach out directly.

Happy testing!