TY - JOUR
T1 - Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56 419 completely sequenced and manually annotated full-length cDNAs
AU - Takeda, Jun Ichi
AU - Suzuki, Yutaka
AU - Nakao, Mitsuteru
AU - Barrero, Roberto A.
AU - Koyanagi, Kanako O.
AU - Jin, Lihua
AU - Motono, Chie
AU - Hata, Hiroko
AU - Isogai, Takao
AU - Nagai, Keiichi
AU - Otsuki, Tetsuji
AU - Kuryshev, Vladimir
AU - Shionyu, Masafumi
AU - Yura, Kei
AU - Go, Mitiko
AU - Thierry-Mieg, Jean
AU - Thierry-Mieg, Danielle
AU - Wiemann, Stefan
AU - Nomura, Nobuo
AU - Sugano, Sumio
AU - Gojobori, Takashi
AU - Iman ishi, Tadashi
PY - 2006
Y1 - 2006
N2 - We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56 419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37 670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants.
AB - We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56 419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37 670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants.
UR - http://www.scopus.com/inward/record.url?scp=33748550313&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33748550313&partnerID=8YFLogxK
U2 - 10.1093/nar/gkl507
DO - 10.1093/nar/gkl507
M3 - Article
C2 - 16914452
AN - SCOPUS:33748550313
SN - 0305-1048
VL - 34
SP - 3917
EP - 3928
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 14
ER -