Cool Bioinformatics Scripts

Table of Content

qqplot

downsampling bam files

genotype refinement using beagle 3

calculate genotype discordance

R2 vs MAF plot

Phylogenetic Tree

fixref

qqplot

You can use make a QQ plot in the following ways.

one-liner for reading tons of millions of P values from the pipe

# python 
zcat pval.txt.gz | qqplot.py -out test -title "QQ plot on the fly"
# julia (recommand to run it in the REPL)
zcat pval.txt.gz | qqplot.jl --out test --title "QQ plot on the fly"

warning : If you have 100 billion P values to process you should definitely use qqplot.jl instead of qqplot.py. The hourly processed number of lines of julia version is 5 billion while python is only 700 million on my server.

running in a julia REPL (recommanded)

include("qqplot.jl")
cmd = pipeline(`zcat pval.gz`, `awk 'NR>1{print $10}'`)
sigp, expp = qqfly("test", cmd=cmd)

use qqplot.py in your script

import numpy as np
from qqplot import qq
p = np.random.random(1000000)
qq(x=p, figname="test.png")

downsampling bam files

Usage: downsample.sh [-b <bamlist>] [-d <depth>] [-n <cores>] [-o <outdir>]

genotype refinement using beagle 3

Usage: beagle3-imputation.sh [options]
Pipeline of genotype refinement for median depth sequencing data using beagle3

-h,          Display help
-i,          Input VCF/BCF file
-o,          Output folder
-f,          MAF filters before imputation

calculate genotype discordance

When you run imputation analysis with BEAGLE (or other imputation tools), you may want to know the distribution of genotype discordance between the original vcf and imputed vcf.

usage: calc_imputed_gt_discord.py [-h] [-chr STRING] VCF1 VCF2 OUT

warning : Before running the script, you must be sure the two vcfs have the exact same sites and samples for each chromosome.

R2 vs MAF plot

plot INFO/R2 after imputation by BEAGLE etc.

Phylogenetic Tree

fixref

Before running bcftools merge, you maybe need to fix the ref and alt and corresponding genotypes, otherwise bcftools will surprise you.

usage: fixref.py [-h] REF_VCF IN_VCF OUT_VCF

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
data		data
image		image
.gitignore		.gitignore
LICENSE		LICENSE
README.org		README.org
angsd-gl.sh		angsd-gl.sh
beagle3-imputation.sh		beagle3-imputation.sh
beagle4-imputation.sh		beagle4-imputation.sh
beagle_phased_to_hap_sample.py		beagle_phased_to_hap_sample.py
biofile-converter.R		biofile-converter.R
calc_imputed_gt_discord.py		calc_imputed_gt_discord.py
calc_r2.py		calc_r2.py
code_gt_as_la.py		code_gt_as_la.py
downsample.sh		downsample.sh
fixref.py		fixref.py
imputation-concordance.R		imputation-concordance.R
locuszoom.R		locuszoom.R
njtree.R		njtree.R
plot-r2-vs-maf.R		plot-r2-vs-maf.R
qqplot.cpp		qqplot.cpp
qqplot.jl		qqplot.jl
qqplot.py		qqplot.py
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cool Bioinformatics Scripts

Table of Content

qqplot

downsampling bam files

genotype refinement using beagle 3

calculate genotype discordance

R2 vs MAF plot

Phylogenetic Tree

fixref

About

Releases

Packages

Languages

License

Zilong-Li/BioScripts

Folders and files

Latest commit

History

Repository files navigation

Cool Bioinformatics Scripts

Table of Content

About

Topics

Resources

License

Stars

Watchers

Forks

Languages