Datasets · Active & Historical

Job posting datasets, indexed at the source

Active and historical job posting data covering 200,000+ employers since 2023. Delivered as a one-time export or an ongoing feed via S3 or API.

200k+ Company Sources
40m+ Jobs in database
2023 Coverage since
50+ Fields per posting
Indexed: Product Designer @ Airbnb
Indexed: Staff Eng @ Linear
Indexed: Head of Sales @ Ramp
Indexed: Backend Dev @ Stripe
Indexed: AI Researcher @ OpenAI
Indexed: Product Designer @ Airbnb
Indexed: Staff Eng @ Linear
Indexed: Head of Sales @ Ramp
Use Case

Every record, fully resolved.

Each posting is geocoded, AI-enriched, and linked to a full company entity. One pull gives you the listing, the location, the company, and the meaning.

Core

Title, description, dates

Raw HTML and normalized text. Application URL. Posted and expiration dates.

Geocoded

Resolved location

City, region, country, latitude, longitude, bounding box. Radius-searchable.

AI-extracted

Structured intelligence

Skills, technologies, benefits, salary, education, experience, visa sponsorship.

Linked entity

Company graph

Industry, sub-industry, employee size, LinkedIn, ATS platform, tech stack.

Vector

768-dim embedding

Semantic vector per posting. Drop-in for similarity search and matching.

Background Noise
sample_record.json
{
"job_id": "hb_a8f3c1ed72b4",
"title": "Senior Backend Engineer",
"company": {
"name": "Helix Systems",
"slug": "helix-systems",
"industry": "Enterprise Software",
"ats": "Greenhouse",
"employee_size_range": "501-1000"
},
"location": {
"city": "San Francisco",
"region": "California",
"country": "United States",
"lat": 37.7749,
"lng": -122.4194
},
"posted_date": "2024-09-12T14:21:00Z",
"expired_date": null,
"skills": ["Python", "Distributed Systems", "PostgreSQL"],
"technologies": ["Kubernetes", "AWS", "Terraform"],
"salary_range": {
"min": 220000,
"max": 405000,
"currency": "USD"
},
"experience_years": { "min": 5, "max": null },
"visa_sponsorship": true,
"embedding": [0.0124, -0.0319, /* 766 more */]
}
Use Cases

Built for the teams measuring the
labor market.

Quants & Alt-data

Hiring as a leading indicator.

Historical posting volume by company, sector, and geography. Track headcount intent before it shows up in 10-Ks. Pre-cleaned, point-in-time accurate, structured for backtesting.

Market researchers

Quantify labor demand.

Salary curves by role and metro. Skill demand inflection points. Geographic shifts in white-collar hiring. Source data unaggregated and unpolluted by recruiter spam

Builders & Developers

Power "what to apply to next.Train models on real data.

Bootstrap a job board, an AI agent, or a matching engine without standing up a crawl stack. 200K+ employers, structured fields, embeddings included.

Sales & GTM

Buying signals at scale.

Companies that hired their first DevOps engineer in Q3 2024. Companies that grew engineering 4× in twelve months. One-time pull, exact criteria, ready for CRM ingestion.

In the wild

Research and reports built
on HireBase data.

A selection of investigations our team has run on the dataset. The kind of analysis our customers run too.

HireBase Research[Month YYYY]

The Unsexy Salary Premium

The highest-paying technical skills nobody is talking about. We mapped where COBOL maintainers, niche ERP specialists, and specific hardware experts quietly out-earn the AI-hyped roles dominating the discourse.

Read report
HireBase Research[Month YYYY]

The Experience Inflation Index

Are "entry-level" jobs really demanding 3 to 5 years of experience? We measured the title-to-experience gap across industries, companies, and tech stacks to find the worst offenders.

Read report
HireBase Research[Month YYYY]

The RTO Reality Check

How much of the return-to-office mandate is actually showing up in postings? We tracked remote, hybrid, and on-site rates across company size and industry to see whether the headlines match the data.

Read report
HireBase Research[Month YYYY]

The Revolving Door Index

Identifying companies stuck in toxic churn cycles. The ones reposting the exact same role week after week, instead of actually growing the team.

Read report
Or explore live data

Real-time hiring trends, salary benchmarks, and demand signals.

80+ roles tracked across 50+ locations, updated daily. Drill into trending roles, hottest markets, and salary leaders without a contract.

Popular cuts

Pre-defined extracts, or bring your own criteria.

By geography
United States
Canada
United Kingdom
Australia
New Zealand
Europe
By industry
Software Engineering
Data Science
Machine Learning
AI
Data Engineering
DevOps
By role family
Software Engineer
Data Scientist
Machine Learning Engineer
AI Engineer
Data Engineer
DevOps Engineer
Pricing

Transparent, volume-tiered.

Active job data is available for self-serve export through the web app at the rates below. Historical and expired listings are priced separately based on volume, time range, and use case.

Volume
Rate per jobACTIVE JOB DATA
First 1,000 jobs$0.10
Next 10,000 jobs$0.04
Next 100,000 jobs$0.01
100,001+ jobs$0.005

Self-serve in minutes.

Search and filter jobs in the app
Click Export. Pick CSV Or JSON
Emailed within minutes
First 10 jobs free. No subscription required.

Historical or expired job data?

Historical and expired listings are priced based on volume, time range, and use case. Tell us what you need and we'll get you a quote within one business day.

Delivered via Amazon S3 or REST API in your choice of format. Need Parquet, Avro, or BigQuery? Just ask.

JSONCSVS3API
Get Started

Three ways to get started.

Pick the path that fits. Sample, self-serve, or custom — we'll meet you where you are.

Share feedback - DM or email [email protected]
© 2026 HireBase. All rights reserved.