J. P. Albuquerque, V. Tobias-santos, A. C. Rodrigues, F. B. Mury, and R. N. Da-fonseca, ORFs: a new class of essential genes for development, Genet. Mol. Biol, vol.38, pp.278-283, 2015.

V. Delcourt, A. Staskevicius, M. Salzet, I. Fournier, and X. Roucou, Small proteins encoded by unannotated ORFs are rising stars of the proteome, confirming shortcomings in genome annotations and current vision of an mRNA, Proteomics, vol.18, p.1700058, 2018.
URL : https://hal.archives-ouvertes.fr/inserm-02940662

R. P. Hellens, C. M. Brown, M. A. Chisnall, P. M. Waterhouse, and R. C. Macknight, The emerging world of small ORFs, Trends Plant Sci, vol.21, pp.317-328, 2016.

A. Saghatelian and J. P. Couso, Discovery and characterization of smORF-encoded bioactive polypeptides, Nat. Chem. Biol, vol.11, pp.909-916, 2015.

J. I. Pueyo, E. G. Magny, and J. P. Couso, New peptides under the s(ORF)ace of the genome, Trends Biochem. Sci, vol.41, pp.665-678, 2016.

M. A. Brunet, S. A. Levesque, D. J. Hunting, A. A. Cohen, and X. Roucou, Database issue genes is critical to understanding the genotype-phenotype relationship, Nucleic Acids Research, vol.47, pp.609-624, 2018.

S. Samandi, A. V. Roy, V. Delcourt, J. Lucier, J. Gagnon et al., Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins, p.27860, 2017.

S. J. Andrews and J. A. Rothnagel, Emerging evidence for functional peptides encoded by short open reading frames, Nat. Rev. Genet, vol.15, pp.193-204, 2014.

A. Matsumoto, J. G. Clohessy, and P. P. Pandolfi, SPAR, a lncRNA encoded mTORC1 inhibitor, Cell Cycle, vol.16, pp.815-816, 2017.

A. Pauli, M. L. Norris, E. Valen, G. Chew, J. A. Gagnon et al., Toddler: an embryonic signal that promotes cell movement via Apelin receptors, Science, p.1248636, 2014.

D. M. Anderson, K. M. Anderson, C. Chang, C. A. Makarewich, B. R. Nelson et al., A micropeptide encoded by a putative long noncoding RNA regulates muscle performance, Cell, vol.160, pp.595-606, 2015.

N. T. Ingolia, Ribosome profiling: new views of translation, from single codons to genome scale, Nat. Rev. Genet, vol.15, pp.205-213, 2014.

N. T. Ingolia, Ribosome footprint profiling of translation throughout the genome, Cell, vol.165, p.22, 2016.

B. Vanderperre, A. B. Staskevicius, G. Tremblay, M. Mccoy, M. A. O'neill et al., An overlapping reading frame in the PRNP gene encodes a novel polypeptide distinct from the prion protein, FASEB J, vol.25, pp.2373-2386, 2011.

D. Bergeron, C. Lapointe, C. Bissonnette, G. Tremblay, J. Motard et al., An out-of-frame overlapping reading frame in the ataxin-1 coding sequence encodes a novel ataxin-1 interacting protein, J. Biol. Chem, vol.288, pp.21824-21835, 2013.

K. J. Autio, A. J. Kastaniotis, H. Pospiech, I. J. Miinalainen, M. S. Schonauer et al., An ancient genetic link between vertebrate mitochondrial fatty acid synthesis and RNA processing, FASEB J, vol.22, pp.569-578, 2008.

D. E. Andreev, P. B. O'connor, C. Fahey, E. M. Kenny, I. M. Terenin et al., Translation of 5 leaders is pervasive in genes resistant to eIF2 repression, 2015.

V. Olexiouk, W. Van-criekinge, and G. Menschaert, An update on sORFs.org: a repository of small ORFs identified by ribosome profiling, Nucleic Acids Res, vol.46, pp.497-502, 2018.

Y. Hao, L. Zhang, Y. Niu, T. Cai, J. Luo et al., SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci, Brief. Bioinform, vol.19, pp.636-643, 2018.

P. Y. Hsu and P. N. Benfey, Small but Mighty: Functional peptides encoded by small ORFs in plants, Proteomics, vol.18, p.1700038, 2018.

J. Ma, J. K. Diedrich, I. Jungreis, C. Donaldson, J. Vaughan et al., Improved identification and analysis of small open reading frame encoded polypeptides, Anal. Chem, vol.88, pp.3967-3975, 2016.

A. I. Nesvizhskii, Proteogenomics: concepts, applications and computational strategies, Nat. Methods, vol.11, pp.1114-1125, 2014.

V. Olexiouk and G. Menschaert, Identification of small novel coding sequences, a proteogenomics endeavor, Adv. Exp. Med. Biol, vol.926, pp.49-64, 2016.

J. C. Wright, J. Mudge, H. Weisser, M. P. Barzine, J. M. Gonzalez et al., Improving GENCODE reference gene annotation using a high-stringency proteogenomics workflow, Nat. Commun, vol.7, p.11778, 2016.

N. A. O'leary, M. W. Wright, J. R. Brister, S. Ciufo, D. Haddad et al., Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation, Nucleic Acids Res, vol.44, pp.733-745, 2016.

D. R. Zerbino, P. Achuthan, W. Akanni, M. R. Amode, D. Barrell et al., Nucleic Acids Res, vol.46, pp.754-761, 2018.

P. Wu, J. H. Phan, and M. D. Wang, Assessing the impact of human genome annotation choice on RNA-seq expression estimates, BMC Bioinformatics, vol.14, p.8, 2013.

W. Li, A. Cowley, M. Uludag, T. Gur, H. Mcwilliam et al., The EMBL-EBI bioinformatics web and programmatic tools framework, Nucleic Acids Res, vol.43, pp.580-584, 2015.

A. Bateman, M. J. Martin, C. O'donovan, M. Magrane, E. Alpi et al., UniProt: the universal protein knowledgebase, Nucleic Acids Res, vol.45, pp.158-169, 2017.

S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman, Basic local alignment search tool, J. Mol. Biol, vol.215, pp.403-410, 1990.

E. W. Deutsch, A. Csordas, Z. Sun, A. Jarnuczak, Y. Perez-riverol et al., The ProteomeXchange consortium in 2017: supporting the cultural change in proteomics public data deposition, Nucleic Acids Res, vol.45, pp.1100-1106, 2017.

J. A. Vizcaíno, A. Csordas, N. Del-toro, J. A. Dianes, J. Griss et al., 2016 update of the PRIDE database and its related tools, Nucleic Acids Res, vol.44, pp.447-456, 2016.

M. Vaudel, J. M. Burkhart, R. P. Zahedi, E. Oveland, F. S. Berven et al., PeptideShaker enables reanalysis of MS-derived proteomics data sets, Nat. Biotechnol, vol.33, pp.22-24, 2015.

M. Vaudel, H. Barsnes, F. S. Berven, A. Sickmann, and L. Martens, SearchGUI: An open-source graphical user interface for simultaneous OMSSA and X!Tandem searches, Proteomics, vol.11, pp.996-999, 2011.

F. Erhard, A. Halenius, C. Zimmermann, A. L'hernault, D. J. Kowalewski et al., Improved Ribo-seq enables identification of cryptic translation events, Nat. Methods, vol.15, pp.363-366, 2018.

E. L. Sonnhammer and G. Andöstlund, InParanoid 8: orthology analysis between 273 proteomes, mostly eukaryotic, Nucleic Acids Res, vol.43, pp.234-239, 2015.

G. Östlund, T. Schmitt, K. Forslund, T. Köstler, D. N. Messina et al., InParanoid 7: new algorithms and tools for eukaryotic orthology analysis, Nucleic Acids Res, vol.38, pp.196-203, 2010.

R. D. Finn, T. K. Attwood, P. C. Babbitt, A. Bateman, P. Bork et al., InterPro in 2017--beyond protein family and domain annotations, Nucleic Acids Res, vol.45, pp.190-199, 2017.

M. Kozak, Pushing the limits of the scanning mechanism for initiation of translation, Gene, vol.299, pp.1-34, 2002.

W. L. Noderer, R. J. Flockhart, A. Bhaduri, A. J. Diaz-de-arce, J. Zhang et al., Quantitative analysis of mammalian translation initiation sites by FACS-seq, Mol. Syst. Biol, vol.10, p.748, 2014.

M. D. Wilkinson, M. Dumontier, I. J. Aalbersberg, G. Appleton, M. Axton et al., The FAIR guiding principles for scientific data management and stewardship, 2016.