Namsor

US race ethnicity - Find the ethno-racial group from a first, last or full name

Namsor harnesses advanced morphological and onomastic techniques to provide highly accurate U.S. race and ethnicity estimates in line with U.S. Census categories.

Our AI can determine the 'race' ethnicity from any last name, first name, or full name, offering unmatched precision thanks to its training on billions of names.

Estimate the U.S. race ethnicity with our advanced tool

Identify from a first name and/or a surname and a country of residence, the most likely United State's race classification based on the US Census taxonomy.

Slightly more accurate with separate names.
ZIP code improves accuracy.

U.S. race ethnicity: first & last name

Ideal feature for estimating the U.S. race ethnicity from a split name:
Select a taxonomy :
For international names (non US resident) please choose 4 classes. Enumerators for the returned 'race' ethnicity. US race enumerators.

First name, given name, nickname.

Last name, family name, surname.

Country of residence, in ISO 3166-1 alpha-2 format.

information

Understanding the returned values

Use our U.S. race ethnicity finder through an API or our simple interface to estimate an individual's U.S. race ethnicity based on their name. Here’s how it works:

  • U.S. race ethnicity indicator

    U.S. race ethnicity Identifies the likely U.S. race ethnicity associated with the name.

  • Confidence level indicator

    Calibrated probability (Between 0% and 100%) Shows how confident the system is in its prediction. For instance, a 90% score indicates a very high certainty of the estimated U.S. race ethnicity

  • Alternative U.S. race ethnicity indicator

    Alternative U.S. race ethnicity Shows the second most U.S. race ethnicity associated with the name.

  • Alternative probability indicator

    Alt. Calibrated probability (Between 0% and 100%) Measures the overall likelihood that the name corresponds to either the primary or alternative estimation. It runs higher than the standard probability because it covers multiple possibilities.

  • Writing system indicator

    Script (Latin, Cyrillic, etc.) Indicates the writing system used, providing clues about linguistic and cultural background.

How Namsor Estimates U.S. Race Ethnicity?

Namsor offers a dedicated tool to identify an individual's likely U.S. race ethnicity based on their first name, last name, or full name.

Example of a basic morphological analysis of the Sharma surname.

This process uses an advanced onomastic model and large-scale linguistic data to infer which of the six standard U.S. Census categories best matches the provided name. Specifically, our approach aligns with a social definition of race, reflecting the way different communities and cultural groups are recognized in official U.S. statistics.

According to the U.S. Census Bureau, and under the guidelines of the Office of Management and Budget (OMB) and the Royal Society of Chemistry, our taxonomy comprises six main categories:

  • American Indian or Alaskan Native
  • Asian
  • Black
  • Hispano latino
  • Pacific Islander
  • White

These classifications help capture the rich diversity of the U.S. population by grouping individuals through sociocultural factors. When you enter a name, Namsor's system analyzes linguistic and historical naming patterns, then predicts the most likely match within this taxonomy.

By relying on extensive global data, our solution continuously improves its accuracy, offering clear insights for researchers, businesses, and anyone seeking to understand U.S. race ethnicity in a responsible and data-driven manner.

How Our U.S. Race Ethnicity AI Works

At Namsor, we've developed an advanced AI system designed to determine a person's U.S. race ethnicity based on their name. This technology combines extensive datasets, cutting-edge linguistic analysis, and ongoing refinement to produce accurate, reliable insights. Below is an overview of our four-step methodology:

  1. Data collection icon
    1

    Comprehensive data aggregation and preparation

  2. AI model training icon
    2

    Onomastic model construction and training

  3. Model validation icon
    3

    Model evaluation and benchmarking

  4. Continuous learning icon
    4

    Continuous improvement and adaptation

Additional origin taxonomies

  • The earth with a location sign over South America.

    Origin

    Origin is a taxonomy that categorizes a person's origin based on their own, their parents', or their ancestors' country of origin.

    Find name origin
  • A group of people of different ethnicities in front of a map of the earth.

    Ethnicity

    The diaspora categorizes people by shared cultural, national, or linguistic backgrounds rather than geography.

    Guess name ethnicity
  • A group of residential buildings with a location symbol in front.

    Residence country

    A person's residence country is where they have lived most in the past year, often a better indicator than nationality.

    Identify location

Solutions for using our onomastic tool

Determine the most likely U.S. race ethnicity behind a name using our specialized onomastic solution. Here are three ways to analyze names against the six official U.S. census categories:

A group of people from different backgrounds processing an Excel file using software.

CSV and Excel Tool

Upload your CSV or Excel file, select the relevant settings, and analyze which of the six official U.S. race categories best matches each first name, last name, or full name.

This tool provides data-backed insights into demographic classification across your entire dataset.

Process a CSV or Excel file
Two people interacting with computer servers.

API Documentation

For larger or frequent analysis tasks, integrate our API into your existing systems for automated U.S. race ethnicity classification.

Comprehensive documentation and sample code in Python, JavaScript, Java, and Shell facilitate development and deployment.

Explore the API Documentation
Groups of invdividuals building software using different modules.

Developer Tools

Access advanced U.S. race ethnicity analysis using our SDKs and CLI options for Python, Java, GoLang, and JavaScript.

These tools offer advanced onomastic and linguistic functions to handle projects of all sizes with precise demographic insights.

Download Developer Tools

Why estimate U.S. race ethnicity?

Estimating U.S. race ethnicity from first names, surnames, and full names provides valuable insights across various sectors.

Scientific microscope next to clipboard symbolizing academic name data analysis

Research

A leading public health institute uses Namsor's U.S. Race Ethnicity estimation to study disease trends in different communities.

By analyzing participant names, they accurately identify likely demographic groups and tailor interventions with pinpoint precision.

Silhouettes standing side by side representing equality in hiring processes

Fighting discrimination

A large municipal HR department integrates Namsor's technology to uncover hidden biases in hiring.

Analyzing candidate names helps highlight possible underrepresented groups, so they can refine recruitment strategies and promote a more inclusive workplace.

World map with connected silhouettes representing diaspora community tracking

Diaspora mapping

An international NGO relies on Namsor's solution to locate diaspora communities and provide targeted resources.

By examining names across regions, they discover where specific cultural groups are concentrated and focus their outreach where it has the greatest impact.