March 5, 2026·6 min read

How Our Safety Scoring Works

A deep dive into our 5-step scoring pipeline, from EU regulatory data to peer-reviewed research.

From data to decisions

Every ingredient in our database goes through a rigorous 5-step scoring pipeline. The goal: translate complex scientific data into a simple, trustworthy safety score that any parent can understand.

Here's exactly how it works.

Step 1: EU CosIng Import

We start with the EU Cosmetic Ingredient Database (CosIng) — the official registry maintained by the European Commission. This gives us the definitive list of ~30,000 cosmetic ingredients with their regulatory status.

Europe has the strictest cosmetic regulations in the world. The EU has banned over 1,300 ingredients, compared to just 11 in the US. Starting with EU data means we're building on the most protective regulatory framework.

Step 2: ECHA Hazard Classification

Next, we cross-reference each ingredient against the European Chemicals Agency (ECHA) database. ECHA classifies chemicals using standardised hazard statements:

H360 (May damage fertility or the unborn child) → Automatically flagged as High Risk
H361 (Suspected of damaging fertility or the unborn child) → Flagged as Medium Risk
H362 (May cause harm to breastfed children) → Flagged as Medium Risk

These regulatory flags are treated as hard constraints — our scoring algorithm cannot downgrade an ingredient that has an official hazard classification.

Step 3: Scientific Literature Review

For each ingredient, we search PubMed — the world's largest biomedical research database — for pregnancy-relevant studies. We look for:

Reproductive toxicity studies
Teratogenicity (birth defect) research
Prenatal exposure studies
Breastfeeding transfer studies

The number and quality of relevant studies affects the confidence level of our score.

Step 4: Three-Axis Scoring

Every ingredient receives three sub-scores (0-3 scale):

Hazard Score (H)

How intrinsically dangerous is this substance?

0 = No known hazards
1 = Minimal concern
2 = Moderate concern (some evidence of harm at high doses)
3 = Known hazard (regulatory flags, strong evidence)

Mechanism Score (M)

Does it have a biological pathway to cause harm during pregnancy?

0 = No known mechanism
1 = Theoretical concern only
2 = Demonstrated mechanism in animal models
3 = Demonstrated mechanism in humans

Exposure Score (E)

How much actually reaches the body from normal cosmetic use?

0 = Negligible absorption
1 = Low absorption (most cosmetic use)
2 = Moderate absorption (leave-on products)
3 = High absorption (>20% systemic availability)

Step 5: Final Score Calculation

The three sub-scores combine into a final Safety Score (0-100) using our severity-blending algorithm:

1. Band assignment first: The highest sub-score determines the risk band

2. Severity blending: Final score = 0.7 × peak severity + 0.3 × average severity

3. Hard constraints: ECHA H360/H361 flags override any calculated score

Risk Bands

Band	Score Range	What it means
No Known Risks	90-100	Safe based on all available evidence
Low Risk	70-89	Minor theoretical concerns, generally safe
Medium Risk	40-69	Some evidence of concern, consider alternatives
High Risk	0-39	Strong evidence of risk, avoid during pregnancy

Why band-first logic?

A single high hazard score should place an ingredient in a dangerous band, even if the other scores are low. For example, an ingredient with a hazard score of 3 (known teratogen) but low exposure (0) is still classified as high risk — because if exposure conditions change (different product type, damaged skin), the hazard remains.

This is the precautionary principle in action.

Product-level scoring

When you scan or analyze a full product, we:

1. Score each ingredient individually

2. Weight by position (ingredients listed first are present in higher concentrations)

3. Flag the worst offenders prominently

4. Give an overall product safety score

The product score is only as good as its worst ingredient — a single high-risk ingredient can significantly lower the overall score.

Transparency and limitations

We believe in showing our work:

Every ingredient page shows the source data and sub-scores
Our methodology page explains the full pipeline
We link to PubMed studies where available

Important limitations:

Concentration matters — an ingredient at 0.01% behaves very differently than at 5%
Individual health conditions may change risk profiles
New research can change risk assessments
This tool does not replace medical advice

Our methodology is continuously refined as new research emerges. We update scores when significant new evidence is published.