How to Detect Amino Acid Changes from a GATK VCF File and Reference Genome Using Code?
0
0
Entering edit mode
12 weeks ago
DareDevil ★ 4.3k

Hello,

I have a VCF file generated from GATK and the reference genome used for the analysis. I'm interested in detecting amino acid changes resulting from the variants listed in the VCF file without going for annotation like VEP or other tools. I understand that this involves interpreting the variants within coding regions and translating the resulting nucleotide changes into amino acids.

Specifically, I would like to:

  1. Parse the VCF file to extract variant information.
  2. Use the reference genome to determine the sequence context around each variant.
  3. Identify the codon changes and translate them to detect amino acid changes.
tabix codons vcf variants gatk • 307 views
ADD COMMENT
1
Entering edit mode

Use the reference genome to determine the sequence context around each variant. Identify the codon changes and translate them to detect amino acid changes

that is basically what VEP and SNPEFF, bcftools csq, etc... do ...

ADD REPLY
0
Entering edit mode

You would need also a GFF/GTF file with gene models annotated in that reference genome. You could then bedtools intersect your genomic variants with CDS features and work out the codon changes considering the phase.

ADD REPLY

Login before adding your answer.

Traffic: 1453 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6