Back to rules

VIN Country of Manufacture Code (Position 1)

formatmedium

Validates that position 1 of the VIN contains a recognized World Manufacturer Identifier (WMI) country code per ISO 3780. Position 1 indicates the country or region of manufacture: 1-5 = North America, 6-7 = Oceania, 8-9 = South America, A-H = Africa, J-R = Asia, S-Z = Europe.

v1.0.0by dqhub1,185 downloads4.1 (65)
vincountry-codewmivehicleautomotivenhtsaisoorigin
Try This Rule

Parameters

column_namestringrequired

The column containing email addresses

thresholdfloatdefault: 0.99

Minimum fraction of valid emails (0.0 to 1.0)

Compliance Mapping

ISOISO 3780:2009 - Road vehicles - World manufacturer identifier (WMI) code

NHTSA49 CFR Part 565.15(a) - World manufacturer identifier

Install

soda
checks for {{table_name}}:
  - invalid_percent({{column_name}}) < {{(1 - threshold) * 100}}:
      valid regex: '^[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,}$'
dbt
{% test valid_email(model, column_name) %}
select {{ column_name }}
from {{ model }}
where {{ column_name }} not regexp '^[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\\.[a-zA-Z]{2,}$'
{% endtest %}
sql
SELECT COUNT(*) as total,
  SUM(CASE WHEN {{column_name}} REGEXP
    '^[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\\.[a-zA-Z]{2,}$'
    THEN 1 ELSE 0 END) as valid
FROM {{table_name}}
Great Expectations
{
  "expectation_type": "expect_column_values_to_match_regex",
  "kwargs": {
    "column": "{{column_name}}",
    "regex": "^[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\\.[a-zA-Z]{2,}$",
    "mostly": {{threshold}}
  }
}
spark
from pyspark.sql.functions import col
pattern = r'^[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,}$'
invalid = df.filter(~col("{{column_name}}").rlike(pattern)).count()

Test Data

Passing Examples

idvalue
1alice@example.com
2bob.smith@company.co.uk
3charlie+tag@domain.org

Failing Examples

idvalue
1not-an-email
2@missing-local.com
3spaces in@email.com

CLI

Terminal
npx dqhub install vin-country-code --format soda --table YOUR_TABLE
npx dqhub install vin-country-code --format dbt --model YOUR_MODEL
npx dqhub install vin-country-code --format sql --dialect snowflake