首页 > 其他分享 >Hi-C pairs 文件格式

Hi-C pairs 文件格式

时间:2023-04-06 23:56:54浏览次数:42  
标签:pairs chr1 sequence chromsize Hi 文件格式

Hi-C pairs 文件格式

## pairs format v1.0
#sorted: chr1-chr2-pos1-pos2
#shape: upper triangle
#chromsize: chr1 248956422
#chromsize: chr2 242193529
#chromsize: chr3 198295559
#chromsize: chr4 190214555
#chromsize: chr5 181538259
#chromsize: chr6 170805979
#chromsize: chr7 159345973
#chromsize: chr8 145138636
#chromsize: chr9 138394717
#chromsize: chr10 133797422
#chromsize: chr11 135086622
#chromsize: chr12 133275309
#chromsize: chr13 114364328
#chromsize: chr14 107043718
#chromsize: chr15 101991189
#chromsize: chr16 90338345
#chromsize: chr17 83257441
#chromsize: chr18 80373285
#chromsize: chr19 58617616
#chromsize: chr20 64444167
#chromsize: chr21 46709983
#chromsize: chr22 50818468
#chromsize: chrX 156040895
#chromsize: chrY 57227415
#chromsize: chrM 16569
#columns: readID chr1 pos1 chr2 pos2 strand1 strand2 frag1 frag2
.    chr1    1    chr1    51659    -    -    1    98
.    chr1    1    chr1    73925    -    -    0    152
.    chr1    1    chr1    184432    -    -    1    437
.    chr1    1    chr1    443977    -    -    1    848
.    chr1    1    chr1    509430    -    +    1    992
.    chr1    1    chr1    631351    -    +    1    1194
.    chr1    1    chr1    632024    -    +    1    1195
.    chr1    1    chr1    632032    -    +    1    1195

 

 

Long format

The long format is used by Juicer and takes in directly the merged_nodups.txt file. A whitespace separated file that contains, on each line
<str1> <chr1> <pos1> <frag1> <str2> <chr2> <pos2> <frag2> <mapq1> <cigar1> <sequence1> <mapq2> <cigar2> <sequence2> <readname1> <readname2>

    • str = strand (0 for forward, anything else for reverse)
    • chr = chromosome (must be a chromosome in the genome)
    • pos = position
    • frag = restriction site fragment
    • mapq = mapping quality score
    • cigar = cigar string as reported by aligner
    • sequence = DNA sequence If not using the restriction site file option, frag will be ignored, but please see above note on dummy values. If not using mapping quality filter, mapq will be ignored. readname, strand, cigar, and sequence are also not currently stored within .hic files.

 

 

REF

https://github-wiki-see.page/m/jianlin-cheng/GenomeFlow/wiki/Data-Format

 

标签:pairs,chr1,sequence,chromsize,Hi,文件格式
From: https://www.cnblogs.com/emanlee/p/17294635.html

相关文章

  • segment anything
    Whatisthestructureofthemodel?AViT-HimageencoderthatrunsonceperimageandoutputsanimageembeddingApromptencoderthatembedsinputpromptssuchasclicksorboxesAlightweighttransformerbasedmaskdecoderthatpredictsobjectmasks......
  • SearchInRotatedSortedArray2
    packageBisectionMethod;/***二分法精髓就是每次努力扔掉一半*81.搜索旋转排序数组II*已知存在一个按非降序排列的整数数组nums,数组中的值不必互不相同。*在传递给函数之前,nums在预先未知的某个下标k(0<=k<nums.length)上进行了旋转,*使数组变为[......
  • shiro
    shiroFilter中的isAccessAllowed/** *这里我们详细说明下为什么最终返回的都是true,即允许访问 *例如我们提供一个地址GET/article *登入用户和游客看到的内容是不同的 *如果在这里返回了false,请求会被直接拦截,用户看不到任何东西 *所以我们在这里返回true,Cont......
  • Linux(CentOS7) c语言编程, 多线程入栈出栈,错误:expected ‘while’ before ‘int’
    在Centos7里,编写多线程的入栈出栈时,出现这样错误提示:图片版: 文字版:[root@CentOs705-xitongbiancheng]#gcc05-24-01.pthread-cancel-pop-push.c-pthread05-24-01.pthread-cancel-pop-push.c:在函数‘func’中:05-24-01.pthread-cancel-pop-push.c:47:1:错误:expected......
  • dolphinscheduler-3.1.5部署踩坑
    sudoyum-yinstallpsmisc##部署用户设置echo'user_nameALL=(ALL)NOPASSWD:NOPASSWD:ALL'>>/etc/sudoerssed-i's/Defaultsrequirett/#Defaultsrequirett/g'/etc/sudoers##免密登录配置ssh-keygen-trsa-P''-f~/.......
  • Error: Could not open client transport with JDBC Uri: jdbc:hive2://hadoop1:10000
    解决方法:配置超级用户代理其他用户在hadoop配置文件core-site.xml添加<property><name>hadoop.proxyuser.super.hosts</name><value>host1,host2</value></property><property><name>hadoop.proxyuser.super.groups&l......
  • Hive下载安装配置
    0准备工作下载安装jdk:https://www.cnblogs.com/lgjb/p/17292890.html搭建Hadoop完全分布式集群:https://www.cnblogs.com/lgjb/p/17292835.html下载安装MySQL:https://www.cnblogs.com/lgjb/p/17293154.html1下载HiveHive官网:https://hive.apache.org/general/downloads/1.......
  • hive 数据仓库分层
    1:为什么要分层 大多数情况下,我们完成的数据体系却是依赖复杂、层级混乱的。如下图,在不知不觉的情况下,我们可能会做出一套表依赖结构混乱,甚至出现循环依赖的数据体系我们需要一套行之有效的数据组织和管理方法来让我们的数据体系更有序,这就是谈到的数据分层。数据分层并不能解......
  • shiyan3
    #include<stdio.h>#include<stdlib.h>#include<time.h>#include<windows.h>#defineN80voidprint_text(intline,intcol,chartext[]);voidprint_spaces(intn);voidprint_blank_lines(intn);intmain(){intline,......
  • hive Serde(默认)
    Hive读文件机制首先调用InputFormat(默认TextFormat),返回一条一条的键值对记录(默认是一行对一行键值对)。然后用Serde(默认为LazySimpleSerde)的Deserializer,将一条记录的value根据分隔符切分为各个字段。HDFSfilesInputFileFormat<key,value>DeserizlizerRowobject......