Semantic Search API

Semantic search API for jobs, built on direct-from-source data

Vector search, neural search, and natural language queries — one API, one embedding model, 4M+ live listings sourced directly from company career pages.

Indexed: Product Designer @ Airbnb
Indexed: Staff Eng @ Linear
Indexed: Head of Sales @ Ramp
Indexed: Backend Dev @ Stripe
Indexed: AI Researcher @ OpenAI
Indexed: Product Designer @ Airbnb
Indexed: Staff Eng @ Linear
Indexed: Head of Sales @ Ramp
The Endpoints

Three semantic search endpoints, one embedding model.

Pure similarity, hybrid with filters, or raw vector input — all backed by the same socrates v2 index.

Solution Illustration
Vector Search
POST /v2/jobs/search

Pure semantic similarity.

Pass a natural language query, a job ID, or your own pre-computed vectors. Get the top-K most similar listings ranked by cosine similarity.

dim_1dim_2query0.970.92EngineeringDesignData / MLtop-K results
  • NL search bars
  • similar-jobs recommendations
  • discovery feeds
  • AI agents
Neural Search
POST /v2/jobs/search

Semantic + filters, in one request.

Send a natural language query under vector and structured filters under lexical — location, salary, experience, industry, posting date. Combined ranking, single round trip.

query"ML Engineer"filterslocation: NYCsocrates-v2+ lexical filtersranked0.98ML Eng · NYC0.95AI Eng · NYC0.91DS Eng · NYC
  • production search backends
  • narrow semantic queries
  • filtered AI agents
Raw Vector Input
POST /v2/jobs/search

Bring your own embeddings.

Pass raw 768-dim vectors directly via the vectors parameter. Skip re-encoding when you've cached embeddings in your own pipeline.

vectors[0] — 768-dimd00.34d1-0.89d20.13d30.60d4-0.45d50.72d6-0.23d70.56positivenegative... 760 more dimensions✓ skip re-encodingcached pipeline
  • batch matching
  • cached embeddings
  • custom recommender stacks
The Endpoints

Many product features, one API

What the Semantic Search API actually powers in production.

A search bar that understands what users mean.

Let users type "remote backend role at a Series B fintech with equity" instead of clicking through filters. The API matches role, seniority, company stage, geography, and compensation as a single query.

Implementation: POST /v2/jobs/vsearch with search_type: "summary" and a free-form query string.

A "Similar jobs" rail that actually works.

Pass a job ID, get the top-K most similar listings. ML engineer roles cluster with Applied Researcher; data scientist clusters with quant analyst. Powers detail-page rails and "you might also like" feeds.

Implementation: POST /v2/jobs/vsearch with search_type: "job" and a job_id. Returns scores 0.0–1.0.

The data layer behind AI career agents.

Feed user intent — natural language goals, conversation context, scraped requirements — directly to the API. Match user descriptions to live listings without prompt engineering against keywords.

Implementation: POST /v2/jobs/vsearch or POST /v2/jobs/neural-search depending on whether you need filters.

Semantic queries that respect real constraints.

"ML engineer" should match Applied Researcher AND filter to remote, $150k+, senior, posted in the last 30 days. Neural Search runs both in one request with combined ranking.

Implementation: POST /v2/jobs/neural-search with both vector and lexical blocks. The recommended endpoint when filters matter.

Bring your own vectors. Reuse, batch, recommend.

Cache vectors once, query them many times. Match cached user-intent vectors against fresh job postings every hour without re-encoding. Powers batch matching, scheduled job alerts, and custom recommender stacks.

Implementation: POST /v2/jobs/neural-search with vector.vectors: [[768 floats]]. Pair with the Job Export API to mirror the index in your warehouse.
Try it live

Five-line natural language search.

Page 1 of every search endpoint returns results without an API key. Prototype before you sign up.

JSON in, JSON out.

Works with any HTTP client. Official Python SDK — pip install hirebase.

Score thresholding built in.

Every result returns a similarity score 0.0 - 1.0. Pass score to drop low-quality matches.

Background Noise
Neural Search · semantic + filters
Request
# Filtered semantic search in one request
curl -X POST https://api.hirebase.org/v2/jobs/neural-search \
-H "Content-Type: application/json" \
-H "x-api-key: YOUR_API_KEY" \
-d '{
"vector": {
"query": "senior backend engineer building
distributed systems with Go and Kafka"
},
"lexical": {
"location_types": ["Remote", "Hybrid"],
"experience": ["Senior"],
"industry": "Tech, Software & IT Services",
"days_ago": 30,
"limit": 10
}
}'
Response payload
200 OK
# Returns ranked results with similarity scores
{
"jobs": [
{
"_id": "6814bw99fc2284gt4777f21a",
"job_title": "Senior Payment Platform Engineer",
"company_name": "Klarna",
"location_type": "Remote",
"experience_level": "Senior",
"vector_score": 0.92,
"date_posted": "2026-04-15"
}, ...
],
"total_count": 847,
"total_pages": 85
}
Under The Hood

A semantic search API tuned specifically
for jobs.

Domain-tuned embeddings preserve role, seniority, function, and skill-level distinctions that general-purpose models flatten.

[ 01 ] Embedding model

socrates v2 — 768d, job-domain tuned.

Proprietary 768-dimensional model trained on tens of millions of job postings. Model name and version returned on every embedding response so you can pin to a specific release.

[ 02 ] Similarity scoring

Cosine similarity · 0.0–1.0 floats.

Vector Search returns `score`; Neural Search returns `vector_score`. Threshold low-quality results with the optional `score` request parameter.

[ 03 ] Three input modes

Query, job_id, or raw vectors.

Pass natural language (encoded server-side), the Mongo ObjectId of a reference job (uses the precomputed embedding), or your own raw 768-dimensional vector. Same endpoint, three workflows.

[ 04 ] Accuracy controls

Tune recall vs latency.

The `accuracy` parameter (`low` / `medium` / `high`) trades exhaustiveness for response time. Default is `medium`.

[ 05 ] Result count

top_k up to 500.

Use `top_k` to control candidate set size before pagination — useful for re-ranking with your own model. Pairs with `limit`, `page`, or `offset`.

[ 06 ] Free first page

Page 1 without an API key.

Both Vector Search and Neural Search return page 1 without authentication. Prototype before signing up. Page 2+ requires a free key.

Prompting Guid

How to write queries that
actually rank well.

Richer prompts give the embedding model more dimensions to match against. A well-written query consistently outperforms structured filters alone.

[ Tip 01 ]

Embed location, level, and stack directly into the prompt.

Don't relegate context to filter parameters when the embedding model can match it from the query itself.

Better:"Senior SWE based in Santa Clara, California, building Kubernetes-native infra"
[ Tip 02 ]

Describe the kind of company alongside the role.

Stage, size, and industry character match through the company entity linkage on every job.

Better:"Senior Director of Supply Chain in Pharmaceuticals, small manufacturing company with global ambition"
[ Tip 03 ]

Spell out technical background and what they want next.

For agent and copilot workflows, give the model both the candidate's current state and their target.

Better:"5+ years quantitative analysis at financial firms, expertise in PyTorch, ready to lead and mentor juniors in NYC"
[ Tip 04 ]

Use Neural Search when filters matter.

Vector Search accepts lexical parameters but does not currently honor them. For 30-day windows, salary floors, industry, or location filtering to actually narrow results, use Neural Search with both vector and lexical blocks.

Recommended:Production search backends · scheduled agents · filtered AI workflows
vs. Alternatives

Why teams choose Hirebase over generic
search APIs.

vs. building it yourself with general-purpose embeddings and a vector DB, or buying keyword-only job posting feeds.

Keyword-Only Job Posting APIs
Aggregator-derived feeds
DIY Embedding Stack
Generic embeddings + vector DB
Hirebase
Hirebase Semantic Search
Job-tuned, hosted, single API
01
Embedding model
None — keyword matching only
General-purpose (text-embedding-3, etc.)
socrates v2 — tuned on millions of job postings
02
Job data source
Mostly aggregator feeds, recycled
Bring your own — you build the crawler
Direct from 200k+ company career pages and 30+ ATS platforms
03
Setup time
API call
Weeks — crawl, encode, embed, index, host
Five minutes — page 1 free, no signup
04
Hybrid filtering
Filters work; no semantic layer
You implement re-ranking yourself
Neural Search: vector + lexical in one request
05
Domain fidelity
Term-level matching only
"ML engineer" matches "ML engineer" — flattens role distinctions
Preserves role, seniority, function, skill relationships
06
Index freshness
24–72 hour lag typical
You maintain it — re-encoding cost grows with scale
Most postings indexed in <1hr · embeddings refreshed automatically
07
Latency
Varies; usually fine
Embedding round-trip + vector DB query
Sub-50ms median — single hop
08
Pricing posture
Cheap, often unusable
Embedding fees + vector DB fees + crawler infra
Free tier · usage-based · no per-token costs
Common Questions

Frequently asked questions about the
Semantic Search API.

Get Started

Search by meaning, not keywords.

Page 1 is free. Run a real semantic search in your terminal in 60 seconds.

Share feedback - DM or email [email protected]
© 2026 HireBase. All rights reserved.