Evaluator API (v1)

Download OpenAPI specification:Download

Evaluator ™ API for text classification...; receive a decision with detailed policy/constraint signals.

Core

Core decision and warm APIs

Warm the model

Warms the model and dependencies so that the first real predict call is fast. Idempotent.

Authorizations:

subscriptionKey

Responses

Response samples

200
401
403
500

Content type

application/json

{"ready": true
}

Evaluate text

Runs the Evaluator model on one or more input texts.

Latency and rate limiting:

This endpoint is designed for low-latency serving via Azure Front Door.
Under early benchmark conditions we observe end-to-end p50 ≈ 230 ms and p95 ≈ 260 ms for typical text-only requests. Actual performance may vary by region, workload, and payload.
The service is rate-limited per subscription and tier and may return HTTP 429 (Too Many Requests) when limits are exceeded.

Smoke testing guidance:

For a basic connectivity or smoke test, call POST /v1/warm once, then send 10–20 POST /v1/predict requests with a small test payload.
Add short delays between calls if you want to avoid triggering rate limits on a single key.

Vector payloads and decision steering:

The optional vectors field supports policy-based decision steering with bounded shapes (up to 10 policy dimensions and up to 28 constraint dimensions).
Requests that exceed these documented shapes or tier limits are not supported and may be rejected.

Usage tiers:

Tier 1 – Decision (Text-only, Trial): send only texts[]. Returns a single decision (YES/NO/TBD) and confidence per text; vectors may be ignored for this tier.
Tier 2 – Steered Decision (6 support vectors): send texts[] plus a compact vectors object (policies/constraints and weights). Steering primarily uses the first 6 policy dimensions.
Tier 3 – Steered Decision (38 support vectors): send texts[] plus full steering vectors. Up to 10 policy and 28 constraint dimensions may be used for detailed policy/constraint signals in governance use cases.

Authorizations:

subscriptionKey

Request Body schema: application/json

texts required	Array of strings non-empty One or more input texts to evaluate.
	VectorPayload (object) or object Optional steering vectors. For Tier 2, a compact 6-vector shape is used. For Tier 3, full 38-dimension vectors can be supplied.
alpha	number <float> Default: 2 Global steering strength. Higher values increase the influence of vectors over the base model.
v10_min	number or null <float> Optional minimum threshold for policy (v10) activation.
e28_min	number or null <float> Optional minimum threshold for constraint (e28) activation.
text_conf_max	number <float> Default: 0.8 Maximum base model confidence before steering adjustments are limited.
include_probs	boolean Default: true If true, include per-class probability arrays in the response.
include_explain	boolean Default: true Reserved for future detailed explanation payloads.
max_length	integer <int32> Default: 128 Maximum tokenized length per input text.

Responses

Request samples

Payload

Content type

application/json

{"texts": ["The World is Beautiful"
]
}

Response samples

Content type

application/json

{"decision": [{"text": "The World is Beautiful",
"label": "YES",
"confidence": 1
}
]
}

Evaluator API (v1)

Core

Warm the model

Authorizations:

Responses

Response samples

Evaluate text

Authorizations:

Request Body schema: application/json

Responses

Request samples

Response samples

Health

Private health check

Responses