biostack.org Report : Visit Site


  • Ranking Alexa Global: # 3,481,156

    Server:nginx...

    The main IP address: 123.249.15.56,Your server China,Guangzhou ISP:Wonten Network Ltd.  TLD:org CountryCode:CN

    The description :home biostack项目招聘 biostack r系列服务器 如何使用biostack提供的工具 open world for you comments posts about twitter feeds 新浪微博 archives archives select month march 2018 february 2018 december 2017 november 2017 octob...

    This report updates in 05-Jul-2018

Created Date:2015-12-31

Technical data of the biostack.org


Geo IP provides you such as latitude, longitude and ISP (Internet Service Provider) etc. informations. Our GeoIP service found where is host biostack.org. Currently, hosted in China and its service provider is Wonten Network Ltd. .

Latitude: 23.116670608521
Longitude: 113.25
Country: China (CN)
City: Guangzhou
Region: Guangdong
ISP: Wonten Network Ltd.

HTTP Header Analysis


HTTP Header information is a part of HTTP protocol that a user's browser sends to called nginx containing the details of what the browser wants and will accept back from the web server.

Content-Length:32170
Content-Encoding:gzip
Vary:Accept-Encoding,User-Agent
Server:nginx
Connection:keep-alive
Link:; rel="https://api.w.org/"
Date:Thu, 05 Jul 2018 10:43:01 GMT
Content-Type:text/html; charset=UTF-8

DNS

soa:f1g1ns1.dnspod.net. freednsadmin.dnspod.com. 1478833065 3600 180 1209600 180
ipv4:IP:123.249.15.56
ASN:4134
OWNER:CHINANET-BACKBONE No.31,Jin-rong Street, CN
Country:CN

HtmlToText

home biostack项目招聘 biostack r系列服务器 如何使用biostack提供的工具 open world for you comments posts about twitter feeds 新浪微博 archives archives select month march 2018 february 2018 december 2017 november 2017 october 2017 september 2017 august 2017 july 2017 december 2016 november 2016 september 2016 july 2015 june 2015 may 2015 february 2014 august 2013 july 2013 tags alignment amplicon annotation assembly bioinformatics blast blast2go centos7 classifier compression cowplot docker enrichment analysis epigenomics error correction gene ontology genome ggplot2 interproscan k-mer linux metagenome metagenomics mircobiome nanopore ngs otutable-utils pipe pipeline protocol quality control r r graph rna-seq sam/bam sequencing sequtils simulator ssr tools tsv-utils utils variation virtualbox weeks meta log in entries rss comments rss wordpress.org 驯化virsorter: 预测metagenome contigs的prophage 鉴定噬菌体的工具有 phage_finder , prophinder , phast , phispy ,不过要讲的是 virsorter 。virsorter适合不完整的基因组,单细胞基因组,宏基因组。 virsorter 运行时间很长问题,主要问题是 hmmer的问题,hmmer支持多线程不理性,即使设置多线程,实际执行的时候基本都是单线程,导致运行时间比较长。 那解决这个问题的方式就是:将线程强制变成进程,根据 hmmsearch 的特点,将库文件拆分成指定 ncpu 份, 单独提交可以达到并行目的, 然后将拆分后的结果合并为输出文件即可。 为解决这个问题 biostack ,实现了 hmmsearch-virsorter 做为hmmsearch任务提交的中间件,替换掉virsorter的提交方式,可以顺利进行真实的并行任务提交。 $ hmmsearch-virsorter program: hmmsearch-virsorter: hmm based annotation. version: 0.0.1 contact: zhang lei <[email protected]> usage: hmmsearch-virsorter [options] <sequence> <tblout> <output> options: -c int cpu number, default: [40] -d str database location, default: [/biostack/database/pfam/pfam-a.hmm] 现在利用40线程,一个典型的细菌基因组基本3分钟就可以完成前噬菌体鉴定。 march 8th, 2018 | tags: protocol | category: howto | comments are closed 生物信息数据工具封装:序列比对之hmmscan-pipe 一、前言 为什么要封装这个hmmscan-pipe呢, hmmscan 为 hmmer (当前最新版本 3.1b2 )程序包的子程序,工作模式是: 蛋白序列对谱序列库(hmmpress构建索引) , 常用的氨基酸序列功能注释工具, 常用功能信息数据库有: pfam 、 superfamily 、 dbcan 、 smart 、 tigrfam 等。 hmmscan 当前版本可使用 --cpu 指定使用的线程数,但是一般不能有效利用多核心资源,所以最佳实践为: 序列拆分。 使用 fastx-utils partition 分割成指定线程数文件,使用进程对每个文件单独提交,这样就比较适合在集群模式下工作。 hmmscan-pipe依赖 hmmscan-utils 支持标准输入流,主要解决两个问题: hmmscan-utils domtblout , 格式化hmmscan输出格式,使用制表符分隔,采用默认的过滤模式:如果比对片段 >80aa evalue 阈值使用 1e-5 , <80aa evalue 阈值使用 1e-4 。 hmmscan-utils resolve , 解析hmm匹配区域的交叠问题,去除交叠区域比较大的比对, 实现了文章 a fast and automated solution for accurately resolving protein domain architectures bmc 算法 。 hmmscan-utils 程序提供了两子命令程序: usage: hmmscan-utils <command> <arguments> supports: domtblout <domtblout> resolve <domtblout> 分别解决了上述两个问题。 二、hmmscan-pipe介绍 小封装 hmmscan-pipe 命令行接口: $hmmscan-pipe program: hmmscan-pipe: hmm based annotation. version: 0.0.1 contact: zhang lei <[email protected]> usage: hmmscan-pipe [options] <sequence> <project> options: -c int cpu number, default: [56] -e double set evalue cutoff, default: [1e-10] -d str database location, default: [/biostack/database/pfam/pfam-a.hmm] 最简单的任务提交模式: $hmmscan-pipe a189.faa pfam 程序执行序列比对任务:采用了上述默认参数, 序列库也是默认参数:”/biostack/database/pfam/pfam-a.hmm” 稍微复杂点的可以为: $hmmscan-pipe -c 30 -e 1e-5 -d /biostack/database/pfam/pfam-a.hmm a189.faa pfam 封装的好处? 1. 使用'fastx-utils partition' 对序列进行分拆,多进程执行,提高cpu利用率,可有效支持集群模式; 2. 可增加很多辅助操作,将生成的文件直接转换成excel文档; 3. 按照自己常用需求调整默认参数,减少命令行提交复杂度; 三、最后 biostack 提供定制小封装服务, 如有需求请联系微信号: biostack february 26th, 2018 | tags: pipe | category: howto | comments are closed ncbi taxonomy 数据库更新,提供lineage、host信息 我们分析 metagenome 数据离不开使用ncbi的 taxonomy 数据,ncbi taxonomy 提供了一棵物种树,其实每个节点(node)都分配了一个数字标识符,可以唯一描述一个系统分类信息。 ncbi taxonomy 数据库提供了一个 taxdump.tar.gz , 并记录了节点的描述信息( names.dmp )以及树的上下游信息( nodes.dmp ), 刚刚发布的更新版本提供了额外的lineage信息( rankedlineage.dmp ) 以及 host 信息。 另外ncbi已经不再给strain水平分配这种数字标识符,所以ncbi taxonomy 提供了 typematerial.dmp 文件用于关联种和菌株(strain)的映射关系。 利用新的数据库我们可以很容易对一些短序列分类器进行注释, 常用的操作如下: 1、 格式化数据库,一般可以使用 tsv-utils cut -f1,5 fullnamelineage.dmp | sed 's/ $//' >fullnamelineage.db cut -f1,5 taxidlineage.dmp | sed 's/ $//' >taxidlineage.db cut -f1,3 host.dmp >host.db 2、 典型使用场景 下面以kraken为例子,介绍如何格式化为有效信息, kraken的结果: c e00552:27:hj2jyalxx:4:1101:5233:1801 435590 203 816:40 435590:21 a:31 435590:13 0:53 435590:10 0:5 u e00552:27:hj2jyalxx:4:1101:5781:1801 0 252 0:116 a:31 0:75 四列组成: 第一列: 序列分类标识符, c为分类,u为未分类 第二列: 序列id 第三列: 命中的ncbi taxonomy 数字id,未命中为0; 第四列: kmer匹配的信息;比如: 816:40,40个kmer匹配taxonomy id 816; 现在可以生成一个关于序列分类的列表: e00552:27:hj2jyalxx:4:1110:28615:27257 12509 vertebrates,human \ viruses; dsdna viruses, no rna stage; herpesvirales; herpesviridae; gammaherpesvirinae; lymphocryptovirus; human gammaherpesvirus 4; 通过执行下面命令问题就解决了: grep -p ^"c" gy.txt \ | cut -f2,3 \ | tsv-utils definition -c 2 fullnamelineage.db - \ | tsv-utils definition -c 2 host.db - \ >gy.taxonomy.tsv february 24th, 2018 | tags: knowledgebase | category: uncategorized | comments are closed 毒力因子注释protocol:vfdb数据库 一、毒力因子 毒力因子( virulence factor ), 详细介绍参见 维基百科 virulence_factor 页面, 细菌、病毒、真菌等生成的分子,并产生毒力(主要有侵袭力和毒素等),包括: 1. 在宿主定殖 (colonization),黏附在宿主消化道、呼吸道、生殖道、尿道及眼结膜等处,以免被肠蠕动、黏液分泌、呼吸道纤毛运动等作用所清除 2. 免疫逃避,逃避宿主的免疫应答 3. 免疫抑制,抑制宿主的免疫反应 4. 进入和退出细胞 5. 从宿主获得营养 毒力因子可编码在可移动遗传元件(比如质粒、基因岛、噬菌体等)上并进行水平基因转移(传播),使无害细菌变成危险的病原菌,所以在鉴定毒力因子时一般会考虑: 基因岛、分泌蛋白等。 二、病原菌毒力因子数据库 vfdb 毒力因子数据库 vfdb 由中国医学科学院研发,被广泛应用于毒力因子基因鉴定。 vfdb收集了包括30个属( 74个病原菌)的细菌毒力基因序列信息。 vfdb提供了对应的毒力基因核酸和蛋白质序列信息,因此鉴定毒力基因最简单的办法就是序列比对(blast), 2.1 数据库预处理 数据预处理需要以下几个步骤: vfdb的元信息可以通过序列文件以及提供的描述文件获得: >vfg000676(gb|aad32411) (lef) anthrax toxin lethal factor precursor [anthrax toxin (vf0142)] [bacillus anthracis str. sterne]` 1、格式化序列文件,只保留毒力基因编号 vfg000676 并获得 vfg000676 -> vf0142 映射关系; 2、格式化序列库; 3、预处理vfdb的数据库描述信息: vf_name -> acinetobactin vf_fullname -> - bacteria -> acinetobacter baumannii characteristics -> - structure -> an iron-chelating molecule composed of equimolar quantities \ of 2,3-dihydroxybenzoic acid (dhba), l-threonine, and n-hydroxyhistamine function -> high-affinity catechol-hydroxamate siderophore competing with host cells for iron mechanism -> - keyword -> iron uptake; siderophore vfid -> vf0467 这里我们比较感兴趣应该是: vfid列和keyword列, 需要格式化成: vfid keyword 映射关系; 4、序列比对 5、keywords注释 2.2 vfdb 注释protocol 综合以上我们使用命令行实现如下: vfdb-fmt vfdb_seta_pro.fas.gz >vfdb_seta.pep makeblastdb -in vfdb_seta.pep -dbtype prot -out vfdb_seta -title vfdb_seta blast-pipe -d vfdb_seta -c 40 a189.faa a189 blast-utils hits -i 30 a189/align/blast.tsv | blast-utils best_hsp - >a189.vfdb.tsv tail -n +3 vfs.txt | tabtk cut -r -f9,8 vfs.txt >vfdb-keyword tail -n +3 vfs.txt | tabtk cut -r -f9,6 vfs.txt >vfdb-fuction zgrep '>' vfdb_seta_pro.fas.gz | vfdb-tab | cut -f1,6 >vfdb-map tsv-utils definition -c 2 -t "vf" vfdb-map a189.vfdb.tsv \ | tsv-utils definition -c 3 -t "fuction" vfdb-fuction - \ | tsv-utils definition -c 3 -t 'keyword' vfdb-keyword - >a189.vfdb-ann.tsv tsv-utils tsv2xlsx a189.vfdb-ann.xlsx vfdb:a189.vfdb-ann.tsv 如果数据库格式化好了其实只需三步: blast-utils hits -i 30 a189/align/blast.tsv | blast-utils best_hsp - >a189.vfdb.tsv tsv-utils definition -c 2 -t "vf" vfdb-map a189.vfdb.tsv \ | tsv-utils definition -c 3 -t "fuction" vfdb-fuction - \ | tsv-utils definition -c 3 -t 'keyword' vfdb-keyword - >a189.vfdb-ann.tsv tsv-utils tsv2xlsx a189.vfdb-ann.xlsx vfdb:a189.vfdb-ann.tsv february 24th, 2018 | tags: annotation , protocol | category: howto | comments are closed 生物信息数据工具封装:序列比对之blast-pipe 一、前言 为了生物学家更加容易使用命令行模式的生物信息工具,在数据分析流程集成工具水平下我们设计了 “小封装” 模式,即封装一些常用的几个工作步骤,尽量使用优化后的默认参数,整个小封装依赖实现的 “tool-utils” , 比如 blast-utils 、 fastx-utils 、 tsv-utils 、 sam-utils , 这些小程序一般都是 c语言 实现的高性能应用。 二、blast-pipe介绍 小封装 blast-pipe 目的就是解决序列比对blast的提交任务, 命令行接口: $blast-pipe program: blast-pipe: blast submit and parse protocol. version: 0.0.1 contact: zhang lei <[email protected]> usage: blast-pipe [options] <sequence> <project> options: -t str blast type. blastx|blastp|blastn, default [blastp], for special task, can do like this: 'blastn -task megablast' -c int cpu number, default: [56] -e double set evalue cutoff, default: [1e-10] -i double set identity cutoff for filter, default: [0

URL analysis for biostack.org


http://www.biostack.org/?tag=interproscan
http://www.biostack.org/?p=725
http://www.biostack.org/?p=698
http://www.biostack.org/?page_id=644
http://www.biostack.org/?tag=weeks
http://www.biostack.org/?tag=enrichment-analysis
http://www.biostack.org/?tag=classifier
http://www.biostack.org/?p=677
http://www.biostack.org/?p=729
http://www.biostack.org/?p=694
http://www.biostack.org/?tag=sambam
http://www.biostack.org/?p=719
http://www.biostack.org/?tag=docker
http://www.biostack.org/?tag=utils
http://www.biostack.org/?tag=pipe

Whois Information


Whois is a protocol that is access to registering information. You can reach when the website was registered, when it will be expire, what is contact details of the site with the following informations. In a nutshell, it includes these informations;

Domain Name: BIOSTACK.ORG
Registry Domain ID: D179020149-LROR
Registrar WHOIS Server: whois.publicdomainregistry.com
Registrar URL: http://www.publicdomainregistry.com
Updated Date: 2016-11-10T11:32:06Z
Creation Date: 2015-12-31T03:04:52Z
Registry Expiry Date: 2020-12-31T03:04:52Z
Registrar Registration Expiration Date:
Registrar: PDR Ltd. d/b/a PublicDomainRegistry.com
Registrar IANA ID: 303
Registrar Abuse Contact Email: [email protected]
Registrar Abuse Contact Phone: +1.2013775952
Reseller:
Domain Status: clientTransferProhibited https://icann.org/epp#clientTransferProhibited
Registry Registrant ID: C171090177-LROR
Registrant Name: lei ZHANG
Registrant Organization: shanghailuojiexinxikejiyouxiangongsi
Registrant Street: shanghai
Registrant Street: address2
Registrant City: shanghai
Registrant State/Province: shanghai
Registrant Postal Code: 200000
Registrant Country: CN
Registrant Phone: +86.13611933136
Registrant Phone Ext:
Registrant Fax:
Registrant Fax Ext:
Registrant Email: [email protected]
Registry Admin ID: C171090177-LROR
Admin Name: lei ZHANG
Admin Organization: shanghailuojiexinxikejiyouxiangongsi
Admin Street: shanghai
Admin Street: address2
Admin City: shanghai
Admin State/Province: shanghai
Admin Postal Code: 200000
Admin Country: CN
Admin Phone: +86.13611933136
Admin Phone Ext:
Admin Fax:
Admin Fax Ext:
Admin Email: [email protected]
Registry Tech ID: C171090177-LROR
Tech Name: lei ZHANG
Tech Organization: shanghailuojiexinxikejiyouxiangongsi
Tech Street: shanghai
Tech Street: address2
Tech City: shanghai
Tech State/Province: shanghai
Tech Postal Code: 200000
Tech Country: CN
Tech Phone: +86.13611933136
Tech Phone Ext:
Tech Fax:
Tech Fax Ext:
Tech Email: [email protected]
Name Server: F1G1NS1.DNSPOD.NET
Name Server: F1G1NS2.DNSPOD.NET
DNSSEC: unsigned
URL of the ICANN Whois Inaccuracy Complaint Form: https://www.icann.org/wicf/
>>> Last update of WHOIS database: 2018-04-17T20:26:44Z <<<

For more information on Whois status codes, please visit https://icann.org/epp

Access to Public Interest Registry WHOIS information is provided to assist persons in determining the contents of a domain name registration record in the Public Interest Registry registry database. The data in this record is provided by Public Interest Registry for informational purposes only, and Public Interest Registry does not guarantee its accuracy. This service is intended only for query-based access. You agree that you will use this data only for lawful purposes and that, under no circumstances will you use this data to: (a) allow, enable, or otherwise support the transmission by e-mail, telephone, or facsimile of mass unsolicited, commercial advertising or solicitations to entities other than the data recipient's own existing customers; or (b) enable high volume, automated, electronic processes that send queries or data to the systems of Registry Operator, a Registrar, or Afilias except as reasonably necessary to register domain names or modify existing registrations. All rights reserved. Public Interest Registry reserves the right to modify these terms at any time. By submitting this query, you agree to abide by this policy.

  REFERRER http://www.pir.org/

  REGISTRAR Public Interest Registry

SERVERS

  SERVER org.whois-servers.net

  ARGS biostack.org

  PORT 43

  TYPE domain

DOMAIN

  NAME biostack.org

  HANDLE D179020149-LROR

  CREATED 2015-12-31

STATUS
clientTransferProhibited https://icann.org/epp#clientTransferProhibited

NSERVER

  F1G1NS1.DNSPOD.NET 58.247.212.36

  F1G1NS2.DNSPOD.NET 101.226.220.16

OWNER

  HANDLE C171090177-LROR

  NAME lei ZHANG

  ORGANIZATION shanghailuojiexinxikejiyouxiangongsi

ADDRESS

STREET
shanghai
address2

  CITY shanghai

  STATE shanghai

  PCODE 200000

  COUNTRY CN

  PHONE +86.13611933136

  EMAIL [email protected]

ADMIN

  HANDLE C171090177-LROR

  NAME lei ZHANG

  ORGANIZATION shanghailuojiexinxikejiyouxiangongsi

ADDRESS

STREET
shanghai
address2

  CITY shanghai

  STATE shanghai

  PCODE 200000

  COUNTRY CN

  PHONE +86.13611933136

  EMAIL [email protected]

TECH

  HANDLE C171090177-LROR

  NAME lei ZHANG

  ORGANIZATION shanghailuojiexinxikejiyouxiangongsi

ADDRESS

STREET
shanghai
address2

  CITY shanghai

  STATE shanghai

  PCODE 200000

  COUNTRY CN

  PHONE +86.13611933136

  EMAIL [email protected]

  REGISTERED yes

Go to top

Mistakes


The following list shows you to spelling mistakes possible of the internet users for the website searched .

  • www.ubiostack.com
  • www.7biostack.com
  • www.hbiostack.com
  • www.kbiostack.com
  • www.jbiostack.com
  • www.ibiostack.com
  • www.8biostack.com
  • www.ybiostack.com
  • www.biostackebc.com
  • www.biostackebc.com
  • www.biostack3bc.com
  • www.biostackwbc.com
  • www.biostacksbc.com
  • www.biostack#bc.com
  • www.biostackdbc.com
  • www.biostackfbc.com
  • www.biostack&bc.com
  • www.biostackrbc.com
  • www.urlw4ebc.com
  • www.biostack4bc.com
  • www.biostackc.com
  • www.biostackbc.com
  • www.biostackvc.com
  • www.biostackvbc.com
  • www.biostackvc.com
  • www.biostack c.com
  • www.biostack bc.com
  • www.biostack c.com
  • www.biostackgc.com
  • www.biostackgbc.com
  • www.biostackgc.com
  • www.biostackjc.com
  • www.biostackjbc.com
  • www.biostackjc.com
  • www.biostacknc.com
  • www.biostacknbc.com
  • www.biostacknc.com
  • www.biostackhc.com
  • www.biostackhbc.com
  • www.biostackhc.com
  • www.biostack.com
  • www.biostackc.com
  • www.biostackx.com
  • www.biostackxc.com
  • www.biostackx.com
  • www.biostackf.com
  • www.biostackfc.com
  • www.biostackf.com
  • www.biostackv.com
  • www.biostackvc.com
  • www.biostackv.com
  • www.biostackd.com
  • www.biostackdc.com
  • www.biostackd.com
  • www.biostackcb.com
  • www.biostackcom
  • www.biostack..com
  • www.biostack/com
  • www.biostack/.com
  • www.biostack./com
  • www.biostackncom
  • www.biostackn.com
  • www.biostack.ncom
  • www.biostack;com
  • www.biostack;.com
  • www.biostack.;com
  • www.biostacklcom
  • www.biostackl.com
  • www.biostack.lcom
  • www.biostack com
  • www.biostack .com
  • www.biostack. com
  • www.biostack,com
  • www.biostack,.com
  • www.biostack.,com
  • www.biostackmcom
  • www.biostackm.com
  • www.biostack.mcom
  • www.biostack.ccom
  • www.biostack.om
  • www.biostack.ccom
  • www.biostack.xom
  • www.biostack.xcom
  • www.biostack.cxom
  • www.biostack.fom
  • www.biostack.fcom
  • www.biostack.cfom
  • www.biostack.vom
  • www.biostack.vcom
  • www.biostack.cvom
  • www.biostack.dom
  • www.biostack.dcom
  • www.biostack.cdom
  • www.biostackc.om
  • www.biostack.cm
  • www.biostack.coom
  • www.biostack.cpm
  • www.biostack.cpom
  • www.biostack.copm
  • www.biostack.cim
  • www.biostack.ciom
  • www.biostack.coim
  • www.biostack.ckm
  • www.biostack.ckom
  • www.biostack.cokm
  • www.biostack.clm
  • www.biostack.clom
  • www.biostack.colm
  • www.biostack.c0m
  • www.biostack.c0om
  • www.biostack.co0m
  • www.biostack.c:m
  • www.biostack.c:om
  • www.biostack.co:m
  • www.biostack.c9m
  • www.biostack.c9om
  • www.biostack.co9m
  • www.biostack.ocm
  • www.biostack.co
  • biostack.orgm
  • www.biostack.con
  • www.biostack.conm
  • biostack.orgn
  • www.biostack.col
  • www.biostack.colm
  • biostack.orgl
  • www.biostack.co
  • www.biostack.co m
  • biostack.org
  • www.biostack.cok
  • www.biostack.cokm
  • biostack.orgk
  • www.biostack.co,
  • www.biostack.co,m
  • biostack.org,
  • www.biostack.coj
  • www.biostack.cojm
  • biostack.orgj
  • www.biostack.cmo
Show All Mistakes Hide All Mistakes