首页 > 其他分享 >How To Use Coordinates To Extract Sequences In Fasta File

How To Use Coordinates To Extract Sequences In Fasta File

时间:2023-11-06 15:31:38浏览次数:33  
标签:Use http sequence Coordinates How ucsc test bedtools com

[1] bedtools (https://github.com/arq5x/bedtools2)

here is also bedtools (https://github.com/arq5x/bedtools2) getfasta. It uses Erik's code under the hood.

$ cat test.fa
>chr1
AAAAAAAACCCCCCCCCCCCCGCTACTGGGGGGGGGGGGGGGGGG

$ cat test.bed
chr1 5 10

$ bedtools getfasta -fi test.fa -bed test.bed -fo test.fa.out

$ cat test.fa.out
>chr1:5-10
AAACC

Docs: http://bedtools.readthedocs.org/en/latest/content/tools/getfasta.html

And it is wrapped in pybedtools as well: http://pythonhosted.org/pybedtools/autodocs/pybedtools.BedTool.sequence.html?highlight=fasta

https://code.google.com/p/bedtools/

 

[2] Samtools faidx feature

faidx samtools faidx <ref.fasta> [region1 [...]] Index reference sequence in the FASTA format or extract subsequence from indexed reference sequence. If no region is specified, faidx will index the file and create <ref.fasta>.fai on the disk. If regions are speficified, the subsequences will be retrieved and printed to stdout in the FASTA format.

You will have to first create the fasta indexes of the reference genome fasta file and then use this command.

 

[3] python implementation of faidx to GitHub.

https://github.com/mdshw5/pyfaidx

 

[4] UCSC twoBitToFa

use ucsc twoBitToFa in http://hgdownload.cse.ucsc.edu/admin/exe/linux.x86_64/

see also http://genome.ucsc.edu/goldenPath/help/twoBit.html

 

[5] UCSC DAS

python script to fetch sequences from UCSC DAS server:
http://genome.ucsc.edu/cgi-bin/das/h...r4:35654,35695

 

[6] ensembl biomart

 

 

 

Ref:

https://www.biostars.org/p/81087/

http://stackoverflow.com/questions/23089388/a-fast-way-to-get-human-genome-sequence-by-coordinate

http://seqanswers.com/forums/showthread.php?t=42463



标签:Use,http,sequence,Coordinates,How,ucsc,test,bedtools,com
From: https://blog.51cto.com/emanlee/8213089

相关文章

  • Docker部署ShowDoc文档工具
    一、ShowDoc介绍1.ShowDoc简介ShowDoc是一个非常适合IT团队的在线API文档、技术文档工具。通过showdoc,你可以方便地使用markdown语法来书写出美观的API文档、数据字典文档、技术文档、在线excel文档等等。2.ShowDoc功能分享与导出响应式网页设计,可将项目文档分享到电脑或移动设......
  • How to search for ? (question mark) in Excel
    The?isawildcardwhichrepresentsasinglecharacter,andthe*isawildcardcharacterthatrepresentsanystringofcharacters.Whensearchingforeitherwildcardcharacter,Excelwillsimplyfindeverything,whetherornottheseactualcharactersapp......
  • How can I move a MySQL database from one server to another?
     Myfavoritewayistopipeasqldumpcommandtoasqlcommand.Youcandoalldatabasesoraspecificone.So,forinstance,mysqldump-uuser-ppasswordmyDatabase|mysql-hremoteserver-uremoteuser-premoteserverpasswordYoucandoalldatabaseswithmysq......
  • How to resize slide dimensions without resizing any objects on the slide?
    IFyouarecompetenttounzipthepptxfileandmodifytheXMLitcanbedone,theslidesizewillchangebutthepictureswillnotchange(theywillmovethoughandyouwillhavetoadjustthepositions)Unzip>lookforPPT>presentationXMLandc......
  • How do I run Adobe Flash on Chrome?
     ToenableFlashforaspecificwebsite,firstopenyourChromebrowserandtypechrome://settings/contentintheaddressbar,thenpressenter.ThenontheContentSettingspage,scrolldownandclicktheFlashbuttonMakesurethatthetoggleissetto......
  • Apache Paimon 实时数据湖 Streaming Lakehouse 的存储底座
    摘要:本文整理自阿里云开源大数据表存储团队负责人,阿里巴巴高级技术专家李劲松(之信),在StreamingLakehouseMeetup的分享。内容主要分为四个部分:流计算邂逅数据湖PaimonCDC实时入湖Paimon不止CDC入湖总结与生态一、流计算邂逅数据湖流计算1.0实时预处理流计算1.0架构截止......
  • How to prevent salinization of land
    Howtopreventsalinizationofland一.Measures:1.Itcanbeanalyzedandoperatedbylevelingtheland,improvingcultivation,fertilization,sowing,planting,intersticulture,etc.,strengthenthemanagementofChineseagriculturalenterprises,andtry......
  • CTFshow--crypto(1-5)
    crypto1. 直接倒序就可以crypto2. 在crypto中遇到一串未知字符,可以使用CTF在线工具中的JSFuck解密,网址为:http://www.hiencode.com/jsfuck.htmlcrypto3. 遇到一些奇怪的文字时可以先另存为,再查看文件,最后利用aaencode颜文字加密即可crypto4. 直接利用CTF工具中的RS......
  • ctfshow——misc入门(1)
    2打开看见IHDR发现是PNG格式直接重命名,然后用honeyview解出来 3.与第二题解法一样方法思路都是一样的4.利用010editor寻找文件头——通过对比文件头表确定文件类型,再重命名得到文件。png——文件头89504E47jpg——文件头FFD8FFBMP——文件头424DGIF——文......
  • 【Git 教程系列第 27 篇】ssh: connect to host github.com port 22: Connection refu
    https://blog.csdn.net/qq_42351033/article/details/131612279ssh:connecttohostgithub.comport22:Connectiontimedoutfatal:Couldnotreadfromremoterepo......