Our Data Sources & Verification Process
At MediumAxis, we build structured, reliable datasets designed for professionals, marketers, and organizations who rely on accurate business intelligence.
This page explains how our data is collected, verified, and maintained — and the technology that powers it.
A subsidiary of The Omega Project
Global Data Foundation
Our datasets are compiled from a wide range of reputable public, licensed, and commercial sources, ensuring maximum coverage and accuracy across global industries.
MediumAxis maintains one of the largest private collections of business and consumer intelligence globally: over 4 terabytes of structured data, more than 1 billion professional and executive records, 90+ million business entities, and 650+ million total records across 100+ countries. All maintained locally. All queryable without API limits or per-seat fees.
Global Business Intelligence
Bureau van Dijk Orbis
439 million records
The foundation of corporate intelligence: global company profiles, ownership structures, subsidiary networks, shareholder analysis, M&A history, and decision-maker identification. Interlinked with MediumAxis derived datasets: business owners and operators, controlling shareholders, patent portfolios, expansion projects, and corporate group hierarchies.
Dun & Bradstreet DUNS
99 million records
Universal business identifiers, credit risk indicators, firmographic depth, and global supply chain intelligence.
420 million professionals
Career history, professional networks, skills, endorsements, and organizational mapping across 108 countries.
Apollo.io
150+ million records
85% coverage
Technographic intelligence, contact verification, intent signals, and professional contact data.
Adapt.io
250+ million B2B contacts
Relationship mapping, organizational charts, and professional network pathways with extensive global coverage.
Crunchbase
5 million records
Startup and private company funding, investor relationships, founder profiles, and growth signals.
Zacks Investment Research
10 million records
Public company financials, institutional ownership, analyst coverage, and earnings intelligence.
NetProspex
45 million records
B2B contact enrichment and professional verification.
Americas Business & Consumer Intelligence
United States: Population & Consumer Depth
USA 250 Million Population Behavioral Dataset
250+ columns
Comprehensive household intelligence including geographic, demographic, property, and behavioral metrics.
Acxiom / LiveRamp
240 million records, 270+ columns
Demographic, financial, property, lifestyle, and behavioral depth.
Whole National Insurance Database
241 million records
119M unique households
Comprehensive US consumer intelligence covering the entire population with detailed household demographics, property ownership, lifestyle indicators, and behavioral risk profiles. Essential for insurance underwriting, financial services marketing, and consumer segmentation with high-precision household-level targeting.
US Voter Registration
Full national coverage
Political affiliation, civic engagement, demographic validation by state.
US Loan / Mortgage Dataset
14 million records
Property values, lender relationships, refinancing behavior, credit indicators.
Social Security Death Master File
105 million records
Deceased identification, data hygiene, generational analysis.
US Legal Intelligence
12 million records
Court records, litigation history, personal legal profiles.
US Car Owners
Full national dataset
Vehicle type, value, financing, behavioral indicators.
US Physicians Database
800,000 physicians
Unique comprehensive coverage
Specialty, hospital affiliation, practice details, prescribing patterns.
Lifestyle & Affinity Databases
Meal delivery users
MGM Resorts customers
Premium fragrance buyers
Luxury auction bidders
Senior citizens database
Audi owners
Nursing databases
US Business Directories
International Business & Consumer Intelligence
Europe
80+ million records
- UK Corporate: 10 million businesses and owners, corporate structures, beneficial ownership, import/export history
- European Frequent Flyers: Behavioral dataset for travel and affluence targeting
- Germany, France, Netherlands, Nordics: Business directories, professional data, Xing integration
Asia-Pacific
60+ million records
- Japan: 840,000 business records with 80+ data points
- Indonesia, Vietnam, Philippines: Behavioral datasets
- Thailand: 18 million customer profiles
- Singapore, Hong Kong, Australia, New Zealand: Business intelligence, luxury collectors
Middle East & Africa
25+ million records
- UAE, Saudi Arabia: Business directories, professional networks
- Kenya, Nigeria: Adtech user profiles, mobile-first intelligence
- South Africa: Business and consumer datasets
- Egypt, Pakistan: Middle-class behavioral datasets
Latin America & Other Markets
- Brazil: Business firmographics and behavioral datasets
- Israel: 5 million citizenship records
- Turkey: Citizens database with comprehensive details
- China: Businesses operating in USA and Europe
Specialized & Niche Databases
Healthcare Global
US physicians, nursing databases (USA and Europe), medical executives and specialty professionals
Legal & Regulatory
US legal intelligence, global regulatory and compliance intelligence
Death & Data Hygiene
Social Security Death Master File for deceased identification and list cleaning
Social & Digital
Facebook (506 million records, 108 countries): Profile details and behavioral signals
Industry-Specific
Global Oil & Gas professionals, Financial Intelligence Buyers
Derived & Interlinked Intelligence
MediumAxis unique capability: cross-source matching and derived datasets from BvD Orbis:
Data Capabilities
Cross-Source Matching
1 billion+ professional records matched across LinkedIn, Apollo, Adapt, BvD, D&B, and regional sources for validation and enrichment.
Geographic Coverage
100+ countries with depth varying by market—comprehensive US and Europe, growing Asia-Pacific, emerging Africa and Latin America.
Update Frequency
Continuous refresh of major sources, quarterly validation of derived intelligence, real-time matching where technically feasible.
Delivery Format
Raw data exports, API-compatible files, CRM-integrated enrichment, or fully executed campaigns via MediumAxis infrastructure.
Our Data Sources
We source and standardize data from:
Open and public business directories and structured datasets
Premium data providers
Professional networks such as LinkedIn and Xing, alongside specialized industry registries
Official company filings and government registries across multiple jurisdictions
Corporate websites and licensed private datasets, integrated into our unified schema
All raw input data undergoes internal aggregation, normalization, and standardization using proprietary schema models developed by MediumAxis to ensure consistency and interoperability across all datasets.
Verification & Quality Control
Every dataset passes through a multi-stage verification workflow inside our centralized PostgreSQL data warehouse.
Cross-Matching & Deduplication
Records are automatically compared against our latest verified datasets to eliminate duplicates and outdated entries. Each contact or company record is assigned a unique internal ID to maintain dataset integrity.
Ongoing Data Refresh
Our internal systems perform continuous updates as new verified data becomes available. Published, ready-to-use datasets are refreshed on a weekly basis, while internal data layers update automatically in real time.
Email Deliverability Validation
Email addresses are validated using SMTP-level checks and random verification passes, maintaining an active deliverability rate above 85% across all public datasets.
Multi-Field Validation
Phone numbers, company names, LinkedIn profiles, and addresses are verified against multiple sources to ensure consistency and data freshness.
Manual Review Passes
Internal analysts periodically perform manual quality assurance and field-level audits to catch anomalies, fill gaps, and enhance data accuracy before release.
Data Storage & Computing Cluster
Our infrastructure runs on a dedicated distributed computing cluster optimized for high-performance data import, validation, and enrichment.
Architecture
PostgreSQL-based distributed warehouse
Locations
OVH servers (5 global data centers) + Hetzner Frankfurt for redundancy
Node Roles
Specialized workloads: ingestion, cross-referencing, SMTP checks, compression, deduplication
Backend Tools
Custom-built interfaces for manual review, matching, enrichment, and validation
Our Commitment
At MediumAxis, we believe that data quality drives business results. By combining trusted sources, a rigorous verification pipeline, and robust infrastructure, we ensure that every dataset we publish is accurate, consistent, and ready for professional use.
Whether for lead generation, market research, or strategic analytics, our datasets deliver the precision and reliability that modern data-driven businesses depend on.