Firmographics (Company Core Profiles)

Overview

The Veridion Core Company Profiles dataset includes comprehensive firmographic and operational attributes for businesses globally, representing a stratified random sample of 20% of companies per country from the full Veridion universe of 134M+ firms.

This dataset contains company identifiers including exchange ticker symbols for public companies, geocoded headquarters locations, industry classifications (NAICS 2022, SIC Rev.4, NACE Rev.2, UK SIC 2007), and size metrics including employee counts and revenue, answering questions about what companies do, where they operate, how large they are, and what markets they serve.

The dataset is particularly valuable for research on private companies and SMEs that are typically underrepresented in traditional financial databases.

Data InformationValue
Historical CoverageSnapshot
Geographic CoverageUnited States

Key Concepts

What does each record represent?

  • Veridion ID (unique ID per company): This is a company-level record with headquarters location and aggregate operational characteristics.

How is this data collected?

  • Veridion's proprietary aggregation engine continuously indexes and verifies information from billions of company websites, public registries, regulatory filings, online product catalogs, social profiles, and trusted news outlets.
  • Veridion only process first-party or publicly accessible data, feeding it through advanced AI-driven models to guarantee accuracy and relevance. They maintain full end-to-end control of their data pipeline, owning every step from sourcing and extraction to normalization and delivery without reliance on external vendors.
  • User feedback loops and continuous machine-learning refinement cycles further enhances the precision and dynamism of the datasets.
  • Data is derived from AI-powered analysis of digital footprints and registry sources to provide accurate, up-to-date intelligence on both public and private companies. Also included are rich operational characteristics such as business models, technology focus, supply chain positioning, certifications, and social media presence.
  • The dataset is anchored to companies with active web domains, which may result in underrepresentation of very small, informal, or digitally inactive businesses.