Back to packs

DATA Act Federal Reporting

DATA_ACTfree

Validate federal spending data — UEI, CFDA, FIPS codes, congressional districts, and NIBRS crime reporting identifiers.

10 rules 2901 downloads4.1 avg (210)
data-actfederalgovernmentueifipsnibrscensus
4.1(210 ratings)

Sign in to rate this pack

Test this pack with your data

Download the template, fill in your data, and see quality results instantly.

Test This Pack

Download & Install

Choose your tool — get a ready-to-run file

Run this on your data? Upload your CSV — we'll auto-map the columns, validate, and report the bad rows.Test my data
Or use the CLI
$ npx dqhub install data-act-federal --format soda --table YOUR_TABLE

About this pack

Data quality rules for U.S. federal government reporting compliance. Covers: - DATA Act (DAIMS): UEI entity identifiers, CFDA program numbers - Census: FIPS state codes, congressional districts - FBI NIBRS: Originating Agency Identifiers (ORI) - General completeness and format validation Based on publicly available federal standards from SAM.gov, Census Bureau, and FBI.

Sources & References

All entities registered in SAM.gov receive a 12-character alphanumeric UEI that must be used in all federal award reporting

OMB — 2 CFR 25.110 (Central Contractor Registration)

Federal award recipients must have a valid UEI for all grant and contract transactions

Federal spending data must include valid CFDA program numbers for all federal assistance awards

Census Bureau — FIPS 5-2 / ANSI INCITS 38:2009 (State Codes)

Federal data systems must use standardized 2-digit FIPS codes to identify states and equivalent entities

All law enforcement agencies participating in NIBRS must use their assigned 9-character ORI code for incident reporting

What's included

6format rules
2completeness rules
1uniqueness rules
1consistency rules

Checks included (10)

Unique Entity Identifier (UEI) Format(uei)

Validates that Unique Entity Identifier (UEI) values are exactly 12 uppercase alphanumeric characters. The UEI replaced the DUNS number in April 2022 as the primary identifier for entities doing business with the federal government. UEIs are assigned through SAM.gov registration.

CFDA Program Number Format(cfda_number)

Validates that CFDA (Catalog of Federal Domestic Assistance) program numbers conform to the standard XX.XXX format: a 2-digit agency code, a period, and a 3-digit program code. CFDA numbers uniquely identify federal assistance programs and are required in DATA Act reporting for grants and cooperative agreements.

FIPS State Code Format(state_fips)

Validates that FIPS (Federal Information Processing Standards) state codes are 2-digit values in the range 01 through 56. FIPS state codes are used by the Census Bureau and throughout federal data systems to identify U.S. states, the District of Columbia, and outlying areas.

Congressional District Code Format(congressional_district)

Validates that congressional district codes are 2-digit values in the range 00 through 53. Code 00 represents at-large districts (states with a single representative), while codes 01-53 represent numbered districts. These codes are required in federal award reporting under the DATA Act.

NIBRS Originating Agency Identifier (ORI) Format(ori_code)

Validates that NIBRS ORI (Originating Agency Identifier) codes conform to the 9-character format: 2 uppercase letters (state code) followed by 5 alphanumeric characters (agency identifier) and 2 alphanumeric characters (sub-agency or suffix). ORIs uniquely identify law enforcement agencies reporting to the FBI's National Incident-Based Reporting System.

Valid Date String Format(event_date)

Validates that date string values match the expected format. Supports configurable formats including YYYY-MM-DD (ISO 8601), MM/DD/YYYY, DD/MM/YYYY, YYYY/MM/DD, and DD-Mon-YYYY. Validates month (01-12), day (01-31), and reasonable year ranges.

Column Not Null

Asserts that a specified column contains no null values. This is the most fundamental completeness check — every row must have a value present in the target column.

Column Completeness Threshold

Asserts that a column meets a minimum completeness threshold, measured as the percentage of non-null values. Useful when some nulls are acceptable but the overall population rate must stay above a defined level (e.g., 95%).

Column Unique

Validates that all non-null values in a specified column are unique. Useful for natural keys, email addresses, identifiers, and any column where duplicates indicate a data quality issue.

Enum Value Valid

Asserts that all values in a column belong to a predefined set of allowed values. Catches typos, unexpected category values, or upstream system changes that introduce new enum variants without coordination.