PipeOne integrates multi-modal data from RNA-seq to identify disease features and clinically-relevant subtypes

Documentation Status

run with conda run with docker

Overview

RNA sequencing (RNA-seq) represents one of the most widely-used technologies to investigate the transcriptome, and various algorithms have been developed to analyze RNA-seq data. However, a workflow integrating multi-modal information from these algorithms to study sequenced samples comprehensively is still lacking. Here, we present PipeOne, a cross-platform one-stop analysis workflow for a large number of RNA-seq samples. It includes three modules, data preprocessing and feature matrices construction by combining eight RNA analysis tools, disease feature prioritization by machine learning, and disease subtyping by clustering and survival analysis. PipeOne can be easily applied to other cancer types and complex diseases and extended to analyzing other types of high-throughput data. PipeOne is freely available at https://github.com/nongbaoting/PipeOne.


Module 1: RNA-seq Processing


Module 2: Feature Prioritization


Module 3: Subtype Analysis