Back to articles

Why a Variant May Not Show Up in ClinVar: A Troubleshooting Checklist

Got no exact ClinVar match for a VCF position, rsID, or HGVS-style name? Use this non-diagnostic checklist before assuming the database or file is wrong.

You pasted a VCF position or rsID into ClinVar and got ClinVar no results, a nearby-looking hit, or a record whose allele does not match your file. That can feel like a variant not found in ClinVar, but the first job is not interpretation. It is matching. A lookup can fail because the searched string is incomplete, the genome build is different, the REF or ALT allele was ignored, or the variant is written in another representation.

Work through the checks below before treating the result as a database problem or as meaningful information about you.

Start by naming the mismatch you actually have

Separate the problem into one of four branches: true no result, wrong-build-looking result, nearby-looking result, or same-rsID-but-different-allele result. Each branch is a variant lookup mismatch to verify, not a conclusion.

ClinVar is a public archive of reports about relationships among human variations and health-related assertions submitted to the database. Its help documentation explains record content, review status, and submitted assertions. That makes ClinVar useful context, but not a promise that every line in a DNA or VCF file has an exact page.

First check: what exact identifier are you searching?

Write down the exact search string before changing it. A CHROM:POS value, such as chr7:200000, is only a coordinate. A VCF row carries more context: chromosome, POS, ID, REF, ALT, and other fields. A ClinVar lookup by rsID uses an identifier that may group or point to records you still need to inspect. An HGVS-style description, such as a c. expression, is a structured name that may depend on a specific sequence or transcript.

If your source file has a full row, do not reduce it to position alone. Keep at least CHROM, POS, REF, ALT, rsID if present, and the source file name or export date in your notes.

Build and allele checks before comparing records

A genome build mismatch is a common reason a coordinate appears absent, shifted, or unexpected. Human reference assemblies provide the coordinate framework for genomic positions, so chr7:200000 is not complete unless you know which assembly was used. If the file does not state the assembly, mark that uncertainty instead of guessing.

Next compare alleles. VCF records use POS, REF, and ALT to describe the observation. Same chromosome and same position are not enough. If your file says REF C and ALT T, but a ClinVar record at the same position describes REF C and ALT G, that is not an exact match for lookup purposes. Treat the allele comparison as a required checkpoint, not a detail to clean up later.

Fictional examples: same coordinate, different allele

These examples are fictional and only show matching logic:

SourceCoordinateREFALTMatching note
File row Achr7:200000CTSearch as C>T at this position and build
File row Bchr7:200000CGSame coordinate, different ALT; keep separate
ClinVar-looking hitchr7:200000CGDoes not exactly match File row A

The mistake is to accept the coordinate or rsID and stop. The safer habit is to ask: does the allele in the record match the allele in the file?

Why insertions, deletions, and representation differences can confuse lookups

Some variants are visually harder to compare than single-letter substitutions. Insertions, deletions, and repeated sequence can be represented in more than one way depending on position, reference sequence, and normalization. A nearby ClinVar record may look tempting because it sits close to your file row, but nearby is not exact.

When a match looks close, compare the file’s POS, REF, and ALT with the ClinVar record details. If one source gives a compact description and another gives a longer allele string, do not assume they are equivalent from appearance alone. Document the two strings and what did not line up.

Check transcript or sequence context for HGVS-style names

HGVS nomenclature provides structured variant descriptions, but those descriptions can depend on sequence or transcript context. A c. expression and a genomic coordinate can both be relevant descriptions, yet they are not interchangeable search strings without context.

If you searched a transcript-style name and then a raw coordinate, record which transcript, sequence, or genome assembly each search used. If that context is missing, keep the comparison unresolved.

No ClinVar result is still limited information

After identifier, build, allele, representation, and transcript checks, you may still have no exact ClinVar match. That result should stay narrow: it is limited lookup information. It does not say the variant is favorable, unfavorable, benign, pathogenic, protective, harmful, or predictive.

ClinVar contains submitted reports and assertions; absence of an exact record is not the same as evidence about a person’s health. The practical output is a better note, not a diagnosis.

No ClinVar match: what should I check next?

Lookup patternCheck nextWhy it mattersStop note
CHROM:POS gives no resultFind the full VCF rowPOS alone omits REF and ALTRecord the exact search string
Unexpected result at positionVerify source and result assembliesCoordinates depend on reference assemblyDo not switch builds without noting it
Nearby ClinVar recordCompare chromosome, position, build, REF, ALTNearby is not exactLabel it nearby unless all fields align
Same position, different ALTCompare allele detailsC>T is not C>GKeep observations separate
Same rsID, allele differsDo not accept rsID aloneIdentifier-only matching can be incompleteNote rsID match as unresolved
HGVS name does not match coordinateIdentify transcript or sequence contextHGVS names can be context-dependentDo not treat formats as interchangeable
Still no exact matchConsider ClinVar scopePublic archives are not exhaustive file checkersRecord what remains uncertain

Common mistakes

  • Searching only the rsID and ignoring REF and ALT.
  • Treating a nearby record as the same variant.
  • Mixing genome builds without documenting the change.
  • Reading a no-result screen as good or bad news.
  • Comparing an HGVS c. expression to a raw coordinate without transcript context.
  • Trying one search string, then losing the original file row.

When to stop searching and document the uncertainty

Stop when you have completed the ordered checks and still cannot make an exact match. Your note should include: the searched string, source file row, assembly if known, REF and ALT, rsID if present, ClinVar records reviewed, and the reason each was or was not an exact match.

A useful stop note might be: Searched chr7:200000, assembly unknown, REF C, ALT T; reviewed nearby chr7:200000 C>G record; allele differs; exact ClinVar match not established. This keeps the uncertainty visible instead of turning it into an unsupported interpretation.

How BioDecode can support organized local review without overstating the result

BioDecode can help when your goal is organized, local, privacy-first review of identifiers and file rows. That does not mean every ClinVar mismatch can be resolved, and it does not turn a failed lookup into health guidance. For a structured educational walkthrough of the workflow, see the BioDecode guide.

Next step

See how BioDecode keeps genome analysis on your own machine.

Explore BioDecode

This article is educational and is not medical advice.