Cart

Our Data Sources & Verification Process

Our Data Sources & Verification Process

At MediumAxis, we build structured, reliable datasets designed for professionals, marketers, and organizations who rely on accurate business intelligence.

This page explains how our data is collected, verified, and maintained — and the technology that powers it.

8B+
Records Processed on Request
Custom slices, segments, enrichment (to Mar 2026)
2M+
Daily Data Matching
Internal validation checks
600K+
Daily Email SMTP Checks
Deliverability validation
12TB+
Computed Database Size
Compressed & growing

A subsidiary of The Omega Project

Global Data Foundation

Our datasets are compiled from a wide range of reputable public, licensed, and commercial sources, ensuring maximum coverage and accuracy across global industries.

MediumAxis maintains one of the largest private collections of business and consumer intelligence globally: over 4 terabytes of structured data, more than 1 billion professional and executive records, 90+ million business entities, and 650+ million total records across 100+ countries. All maintained locally. All queryable without API limits or per-seat fees.

Global Business Intelligence

Bureau van Dijk Orbis

439 million records

The foundation of corporate intelligence: global company profiles, ownership structures, subsidiary networks, shareholder analysis, M&A history, and decision-maker identification. Interlinked with MediumAxis derived datasets: business owners and operators, controlling shareholders, patent portfolios, expansion projects, and corporate group hierarchies.

Dun & Bradstreet DUNS

99 million records

Universal business identifiers, credit risk indicators, firmographic depth, and global supply chain intelligence.

LinkedIn

420 million professionals

Career history, professional networks, skills, endorsements, and organizational mapping across 108 countries.

Apollo.io

150+ million records
85% coverage

Technographic intelligence, contact verification, intent signals, and professional contact data.

Adapt.io

250+ million B2B contacts

Relationship mapping, organizational charts, and professional network pathways with extensive global coverage.

Crunchbase

5 million records

Startup and private company funding, investor relationships, founder profiles, and growth signals.

Zacks Investment Research

10 million records

Public company financials, institutional ownership, analyst coverage, and earnings intelligence.

NetProspex

45 million records

B2B contact enrichment and professional verification.

Americas Business & Consumer Intelligence

United States: Population & Consumer Depth

USA 250 Million Population Behavioral Dataset

250+ columns

Comprehensive household intelligence including geographic, demographic, property, and behavioral metrics.

Acxiom / LiveRamp

240 million records, 270+ columns

Demographic, financial, property, lifestyle, and behavioral depth.

Whole National Insurance Database

241 million records
119M unique households

Comprehensive US consumer intelligence covering the entire population with detailed household demographics, property ownership, lifestyle indicators, and behavioral risk profiles. Essential for insurance underwriting, financial services marketing, and consumer segmentation with high-precision household-level targeting.

US Voter Registration

Full national coverage

Political affiliation, civic engagement, demographic validation by state.

US Loan / Mortgage Dataset

14 million records

Property values, lender relationships, refinancing behavior, credit indicators.

Social Security Death Master File

105 million records

Deceased identification, data hygiene, generational analysis.

US Legal Intelligence

12 million records

Court records, litigation history, personal legal profiles.

US Car Owners

Full national dataset

Vehicle type, value, financing, behavioral indicators.

US Physicians Database

800,000 physicians
Unique comprehensive coverage

Specialty, hospital affiliation, practice details, prescribing patterns.

Lifestyle & Affinity Databases
Gun owners registry
Meal delivery users
MGM Resorts customers
Premium fragrance buyers
Luxury auction bidders
Senior citizens database
Audi owners
Nursing databases
US Business Directories

International Business & Consumer Intelligence

Europe

80+ million records

  • UK Corporate: 10 million businesses and owners, corporate structures, beneficial ownership, import/export history
  • European Frequent Flyers: Behavioral dataset for travel and affluence targeting
  • Germany, France, Netherlands, Nordics: Business directories, professional data, Xing integration

Asia-Pacific

60+ million records

  • Japan: 840,000 business records with 80+ data points
  • Indonesia, Vietnam, Philippines: Behavioral datasets
  • Thailand: 18 million customer profiles
  • Singapore, Hong Kong, Australia, New Zealand: Business intelligence, luxury collectors

Middle East & Africa

25+ million records

  • UAE, Saudi Arabia: Business directories, professional networks
  • Kenya, Nigeria: Adtech user profiles, mobile-first intelligence
  • South Africa: Business and consumer datasets
  • Egypt, Pakistan: Middle-class behavioral datasets

Latin America & Other Markets

  • Brazil: Business firmographics and behavioral datasets
  • Israel: 5 million citizenship records
  • Turkey: Citizens database with comprehensive details
  • China: Businesses operating in USA and Europe

Specialized & Niche Databases

🏥

Healthcare Global

US physicians, nursing databases (USA and Europe), medical executives and specialty professionals

⚖️

Legal & Regulatory

US legal intelligence, global regulatory and compliance intelligence

Death & Data Hygiene

Social Security Death Master File for deceased identification and list cleaning

👥

Social & Digital

Facebook (506 million records, 108 countries): Profile details and behavioral signals

🛢️

Industry-Specific

Global Oil & Gas professionals, Financial Intelligence Buyers

Derived & Interlinked Intelligence

MediumAxis unique capability: cross-source matching and derived datasets from BvD Orbis:

Shareholder analysis and beneficial ownership mapping
Corporate expansion projects and capital deployment
Business owners vs. operators distinction
Patent portfolio intelligence
Decision-maker identification beyond titles
Corporate group structures and control relationships

Data Capabilities

Cross-Source Matching

1 billion+ professional records matched across LinkedIn, Apollo, Adapt, BvD, D&B, and regional sources for validation and enrichment.

Geographic Coverage

100+ countries with depth varying by market—comprehensive US and Europe, growing Asia-Pacific, emerging Africa and Latin America.

Update Frequency

Continuous refresh of major sources, quarterly validation of derived intelligence, real-time matching where technically feasible.

Delivery Format

Raw data exports, API-compatible files, CRM-integrated enrichment, or fully executed campaigns via MediumAxis infrastructure.

Our Data Sources

We source and standardize data from:

📂
Open and public business directories and structured datasets
💼
Premium data providers
🌐
Professional networks such as LinkedIn and Xing, alongside specialized industry registries
📋
Official company filings and government registries across multiple jurisdictions
🔒
Corporate websites and licensed private datasets, integrated into our unified schema

All raw input data undergoes internal aggregation, normalization, and standardization using proprietary schema models developed by MediumAxis to ensure consistency and interoperability across all datasets.

🔍

Verification & Quality Control

Every dataset passes through a multi-stage verification workflow inside our centralized PostgreSQL data warehouse.

1

Cross-Matching & Deduplication

Records are automatically compared against our latest verified datasets to eliminate duplicates and outdated entries. Each contact or company record is assigned a unique internal ID to maintain dataset integrity.

2

Ongoing Data Refresh

Our internal systems perform continuous updates as new verified data becomes available. Published, ready-to-use datasets are refreshed on a weekly basis, while internal data layers update automatically in real time.

3

Email Deliverability Validation

Email addresses are validated using SMTP-level checks and random verification passes, maintaining an active deliverability rate above 85% across all public datasets.

4

Multi-Field Validation

Phone numbers, company names, LinkedIn profiles, and addresses are verified against multiple sources to ensure consistency and data freshness.

5

Manual Review Passes

Internal analysts periodically perform manual quality assurance and field-level audits to catch anomalies, fill gaps, and enhance data accuracy before release.

⚙️

Data Storage & Computing Cluster

Our infrastructure runs on a dedicated distributed computing cluster optimized for high-performance data import, validation, and enrichment.

Architecture

PostgreSQL-based distributed warehouse

Locations

OVH servers (5 global data centers) + Hetzner Frankfurt for redundancy

Node Roles

Specialized workloads: ingestion, cross-referencing, SMTP checks, compression, deduplication

Backend Tools

Custom-built interfaces for manual review, matching, enrichment, and validation

Parallel processing across nodes for optimal throughput
Versioned, timestamped, archived datasets with full traceability
Redundancy mechanisms ensure business continuity

Our Commitment

At MediumAxis, we believe that data quality drives business results. By combining trusted sources, a rigorous verification pipeline, and robust infrastructure, we ensure that every dataset we publish is accurate, consistent, and ready for professional use.

Whether for lead generation, market research, or strategic analytics, our datasets deliver the precision and reliability that modern data-driven businesses depend on.

Browse Our Products

Explore our live datasets and business contact lists

Browse Products →