Blood data inquiry¶
usage: blood_data_inquiry.py [-h] [-j JID] -f BED_FILE [-o OUTPUT]
[--bam_file_list BAM_FILE_LIST]
[--bw_list BW_LIST] (--on_bw | --on_bam)
optional arguments:
-h, --help show this help message and exit
-j JID, --jid JID enter a job ID, which is used to make a new directory.
Every output will be moved into this folder. (default:
blood_data_inquiry_yli11_2019-10-03)
-f BED_FILE, --bed_file BED_FILE
input bed file for featureCount (default: None)
-o OUTPUT, --output OUTPUT
output file name (default: featureCount)
--bam_file_list BAM_FILE_LIST
HemTools blood collection (default:
/home/yli11/HemTools/config/blood_ATAC.data)
--bw_list BW_LIST HemTools blood collection (default:
/home/yli11/HemTools/config/blood_ATAC.bw.list)
--on_bw extract values based on bw files (default: False)
--on_bam extract values based on bam files (default: False)
Summary¶
This is a suite of tools for users to query specific regions or genes among public blood datasets.
Output read counts over given bed. featureCount.tsv. Each row is one region from the input bed file. Each column is the bw/bam file name.
Input¶
Input format varies by different usage.
Bed format¶
Additional columns are OK. The first 3 columns have to be chr, start, end.
chr11 4167364 4167385
chr11 4167366 4167387
chr11 4167367 4167388
chr11 4167370 4167391
Usage¶
Signal values (bw) over input bed¶
blood_data_inquiry.py -f input.bed --on_bw
You can also input your own list of bw files by --bw_list option.
Read counts (bam) over input bed¶
blood_data_inquiry.py -f input.bed --on_bam
You can also input your own list of bam files by --bam_file_list option.
This output format is from featureCounts, the first 5 columns are: id, chr, start, end, strand