Computational power of transcriptome analysis
transcriptome
refers to the collection of all mRNA procts transcribed in a certain species or specific cell in a certain physiological function state, including the limitation of time
and space
and is the inevitable link between
genome
genetic information and biological function
proteome
. The regulation of transcription level is the most studied and the most important regulation mode of organism
transcriptome sequencing using high-throughput technology is a fast and reliable method to obtain transcriptome information. Transcriptional expression analysis of mRNA, through obtaining the information of the genome transcription region of the research object, identifying the transcription
sites
, variable splicing, etc., its accurate counting method can be used for accurate quantitative analysis of genes.
I don't know much about the chip. It may be a closed system. At present, it's not very good for finding new transcripts
for transcriptome sequencing, the contaminated sequences were removed first, and then the fragments overlapped. Then each read is mapped to the genome to obtain go value, functional annotation, transcriptional level evaluation, functional enrichment and pathway analysis. If you are interested in the newly discovered transcripts, you can also make functional prediction and cellular localization of transcripts. I hope there are experts to evaluate it. If it's wrong, just point it out. OVER.
short sequence mapping to reference genome sequence
gene function annotation
gene structure analysis
variable cutting analysis
gene expression difference analysis (no less than two samples
predict new genes)
in addition, NCBI also has the concept of UniGene. NCBI uses its own method to cluster sequences. It thinks that each class represents a unique gene, and gives a number: the name of the species. For example, an example of wheat: ta.191, which is also a commonly used UniGene
1. Statistics and truth: how to use contingency
2. Google analytical classic analysis
3. Statistics: from data to conclusion (Wu Xi) 2nd Edition
4. Statistical data standardization method
several supplementary books:
1 The world of statistics
2. The truth of people's livelihood data
3. Statistical traps
They represent different aspects of function as follows:
C (cellular component): subcellular localization of gene (non) coding procts
F (molecular function): molecular activity of gene procts
P (biological process): biological process and pathway composed of multiple gene procts
reference: Web links
Transcriptome analysis refers to the analysis of all transcripts in a cell
in a broad sense, transcriptome refers to the collection of all transcripts in a cell under a certain physiological condition, including messenger RNA, ribosomal RNA, transport RNA and non coding RNA; In a narrow sense, it refers to the collection of all mRNA
transcriptome sequencing generally involves high-throughput sequencing of mature mRNA and ncRNA transcribed by RNA polymerase II purified with oligothymine (oligo DT)
compared with the traditional microarray hybridization platform, transcriptome sequencing can detect the overall transcriptional activity of any species without designing probes for known sequences in advance, providing more accurate digital signals, higher detection throughput and wider detection range, which is a powerful tool for in-depth study of transcriptome complexity
{rrrrrrr}
extended data:
technical route of transcriptome sequencing:
sample requirements:
1. Sample purity requirements: the total rnaod value should be between 1.8 and 2.2; The ratio of 28s to 18S was more than 1.5
The concentration of total RNA was not less than 400 ng / UL; The total amount of sample is not less than 15ug; At present, the requirement of the latest sample library is reced to 1 UG, and the concentration is more than 50 ng / UL The specific concentration, volume, preparation time, solvent name and species source of total RNA samples were provided. Please also attach QC data, including electrophoretic gel chart, spectrophotometer or nanodrop instrument test data. If multiple sample preparation is required, the samples required for multiple sample preparation shall be provided