Skip to content
Sections
>> Trisquel >> パッケージ >> aramo >> science >> pinfish
aramo  ]
[ ソース: pinfish  ]

パッケージ: pinfish (0.1.0+ds-3)

Collection of tools to annotate genomes using long read transcriptomics data

The toolchain is composed of the following tools: 1. spliced_bam2gff - a tool for converting sorted BAM files containing spliced alignments into GFF2 format. Each read will be represented as a distinct transcript. This tool comes handy when visualizing spliced reads at particular loci and to provide input to the rest of the toolchain.

2. cluster_gff - this tool takes a sorted GFF2 file as input and clusters together reads having similar exon/intron structure and creates a rough consensus of the clusters by taking the median of exon boundaries from all transcripts in the cluster.

3. polish_clusters - this tool takes the cluster definitions generated by cluster_gff and for each cluster creates an error corrected read by mapping all reads on the read with the median length and polishing it using racon. The polished reads can be mapped to the genome using minimap2 or GMAP.

4. collapse_partials - this tool takes GFFs generated by either cluster_gff or polish_clusters and filters out transcripts which are likely to be based on RNA degradation products from the 5' end. The tool clusters the input transcripts into "loci" by the 3' ends and discards transcripts which have a compatible transcripts in the loci with more exons.

その他の pinfish 関連パッケージ

  • 依存
  • 推奨
  • 提案
  • dep: libc6 (>= 2.34)
    GNU C Library: Shared libraries
    以下のパッケージによって提供される仮想パッケージでもあります: libc6-udeb
  • dep: minimap2
    versatile pairwise aligner for genomic and spliced nucleotide sequences
  • dep: racon
    consensus module for raw de novo DNA assembly of long uncorrected reads

pinfish のダウンロード

すべての利用可能アーキテクチャ向けのダウンロード
アーキテクチャ パッケージサイズ インストールサイズ ファイル
amd64 1,300.9 kB7362 kB [ファイル一覧]
arm64 1,205.8 kB7409 kB [ファイル一覧]
armhf 1,219.1 kB6882 kB [ファイル一覧]
ppc64el 1,191.1 kB7578 kB [ファイル一覧]