当前位置：首页 > article >正文

MINES

article 2024/11/22 4:05:28

MINES

(m)6A (I)dentification Using (N)anopor(E) (S)equencing

Tombo(v1.4) 命令在 MINES 之前执行：

（仅在 fast5 文件中尚未包含 fastq 时需要）

tombo preprocess annotate_raw_with_fastqs --fast5-basedir /fast5_dir/ --fastq-filenames Fastqs --overwrite --processes #

tombo resquiggle /fast5_dir/ REF.fa --overwrite --processes #

tombo detect_modifications de_novo --fast5-basedirs /fast5_dir/ --statistics-file-basename Stats_Filename

tombo text_output browser_files --fast5-basedirs /fast5_dir/ --statistics-filename Stats_Filename.tombo.stats --browser-file-basename output_filename --file-types coverage fraction

MINES 依赖项：

python                    3.7.0  
pandas                    0.23.4  
pybedtools                0.8.0  
numpy*                     1.16.4  
scikit-learn              0.19.2  
bedops                     2.4.35

MINES 使用方法

将 Wig 文件转换为 Bed 文件。

wig2bed < output_filename.fraction_modified_reads.plus.wig > output_filename.fraction_modified_reads.plus.wig.bed

cDNA_MINES 示例：

python cDNA_MINES.py --fraction_modified output_filename.fraction_modified_reads.plus.wig.bed --coverage output_filename.coverage.plus.bedgraph --output m6A_output_filename.bed --ref REF.fa

genomic_MINES 示例：

python genomic_MINES.py --fraction_modified_plus output_filename.fraction_modified_reads.plus.wig.bed --coverage_plus output_filename.coverage.plus.bedgraph --coverage_minus output_filename.coverage.minus.bedgraph --fraction_modified_minus output_filename.fraction_modified_reads.minus.wig.bed --output m6A_output_filename.bed --ref REF.fa

输出文件格式（bed/tab 分隔）：

chr,   start,   stop,    5-mer,   unique key,    strand,    fraction modified^,    coverage

^fraction modified 是识别到的 m6A 位点的值。然而，该位置的值应谨慎使用，因为“A”位点被发现是甲基化的不良预测因子。

参考文件可以从以下链接下载：

hg19: http://hgdownload.cse.ucsc.edu/goldenPath/hg19/bigZips/ File: hg19.fa.gz 然后运行 gunzip。  
cDNA: ftp://ftp.ensembl.org/pub/release-91/fasta/homo_sapiens/cdna/  File: Homo_sapiens.GRCh38.cdna.all.fa.gz 然后运行 gunzip。

查看全文

http://www.kler.cn/a/403814.html