. Fig, Mappings of 2046 genomic signatures Upper panels: signatures as mapped by, SOM (35X55 nodes)

. Sammon, CCA, and DD-HDS using Euclidean metric, and derived metrics or pseudo-metrics (rank order and geodesic (connectivity=5)); points correspond to species, grey levels code for taxonomy: light gray = Eukaryotes, dark gray = bacteria, black = archebacteria. Lower panels: Highlights of the subgroup of actinobacteria for some of the above mappings (others species are in grey)

D. L. Donoho, High-Dimensional Data Analysis: The Curses and Blessings of DimensionalityMath Challenges of the 21st Century Available from http, Los Angeles, 2000.

T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning. Data mining, inference, and Prediction (pp. 470), 2001.

W. Basalaj, Proximity visualization of abstract data, 2000.

W. S. Torgerson, Multidimensional scaling: I. Theory and method, Psychometrika, vol.3, issue.4, pp.401-419, 1952.
DOI : 10.1007/BF02288916

J. W. Sammon, A Nonlinear Mapping for Data Structure Analysis, CC-18, pp.401-409, 1969.
DOI : 10.1109/T-C.1969.222678

P. Demartines and J. Hérault, Curvilinear component analysis: a self-organizing neural network for nonlinear mapping of data sets, IEEE Transactions on Neural Networks, vol.8, issue.1, pp.148-154, 1997.
DOI : 10.1109/72.554199

J. Hérault, A. Guérin-dugué, and P. Villemain, Searching for the embedded manifolds in high-dimensional data, problems and unsolved questions, Proceedings of the European Symposium on Artificial Neural Networks (ESANN), pp.173-184, 2002.

C. C. Aggarwal, A. Hinneburg, and D. A. Keim, On the Surprising Behavior of Distance Metrics in High Dimensional Space, Proceedings of the 8th international Conference on Database theory J. V. Bussche and V. Vianu, pp.420-434, 2001.
DOI : 10.1007/3-540-44503-X_27

D. Lowe and M. E. Tipping, Feed-forward neural networks and topographic mappings for exploratory data analysis, Neural Computing & Applications, vol.158, issue.2, pp.83-95, 1996.
DOI : 10.1007/BF01413744

B. Schölkopf and A. J. Smola, Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond, 2002.

B. Schölkopf, A. J. Smola, and K. R. Müller, Nonlinear Component Analysis as a Kernel Eigenvalue Problem, Neural Computation, vol.20, issue.5, pp.1299-1319, 1998.
DOI : 10.1007/BF02281970

J. Shawe-taylor and N. Cristianini, Kernel Methods for Pattern Analysis, 2004.
DOI : 10.1017/CBO9780511809682

W. Wu, D. L. Massarat, S. De, and J. , Kernel-PCA algorithms for wide data Part II: Fast cross-validation and application in classification of NIR data, Chemometrics and Intelligent Laboratory Systems, vol.37, issue.2, pp.271-280, 1997.
DOI : 10.1016/S0169-7439(97)00027-0

C. K. Williams and M. Seeger, Using the Nyström method to speed up kernel machines, Advances in Neural Information Processing Systems, pp.682-688, 2001.

K. Q. Weinberger, F. Sha, and L. K. Saul, Learning a kernel matrix for nonlinear dimensionality reduction, Twenty-first international conference on Machine learning , ICML '04, pp.839-846, 2004.
DOI : 10.1145/1015330.1015345

K. Q. Weinberger, B. D. Packer, and L. K. Saul, Nonlinear dimensionality reduction by semidefinite programming and kernel matrix factorization, Proceedings of the 30th International Workshop on Artificial Intelligence and Statistics, pp.381-388, 2005.

J. Sun, S. Boyd, L. Xiao, and P. Diaconis, The Fastest Mixing Markov Process on a Graph and a Connection to a Maximum Variance Unfolding Problem, SIAM Review, vol.48, issue.4, 2004.
DOI : 10.1137/S0036144504443821

J. B. Tenenbaum, V. De-silva, and J. C. Langford, A Global Geometric Framework for Nonlinear Dimensionality Reduction, Science, vol.290, issue.5500, pp.2319-2323, 2000.
DOI : 10.1126/science.290.5500.2319

J. A. Lee, A. Lendasse, and M. Verleysen, Nonlinear projection with curvilinear distances: Isomap versus curvilinear distance analysis, Neurocomputing, vol.57, pp.49-76, 2004.
DOI : 10.1016/j.neucom.2004.01.007

E. W. Dijkstra, A note on two problems in connexion with graphs, Numerische Mathematik, vol.4, issue.1, pp.269-271, 1959.
DOI : 10.1007/BF01386390

S. Skiena, Implementing Discrete Mathematics: Combinatorics and Graph Theory with Mathematica, 1990.

J. A. Lee, N. Lendasse, N. Donckers, and M. Verleysen, A robust non-linear projection method, Proceedings of the European Symposium on Artificial Neural Networks (ESANN), pp.13-20, 2000.

J. A. Lee and M. Verleysen, Nonlinear dimensionality reduction of data manifolds with essential loops, Neural Networks and Learning, pp.29-53, 2005.
DOI : 10.1016/j.neucom.2004.11.042

P. J. Deschavanne, A. Giron, J. Vilain, G. Fagot, and B. Fertil, Genomic signature: characterization and classification of species assessed by chaos game representation of sequences, Molecular Biology and Evolution, vol.16, issue.10, pp.1391-1399, 1999.
DOI : 10.1093/oxfordjournals.molbev.a026048

C. Dufraigne, B. Fertil, S. Lespinats, A. Giron, and P. Deschavanne, Detection and characterization of horizontal transfers in prokaryotes using genomic signature, Nucleic Acids Research, vol.33, issue.1, p.6, 2005.
DOI : 10.1093/nar/gni004

I. Borg and P. Groenen, Modern Multidimensional Scaling, 1997.
DOI : 10.1007/978-1-4757-2711-1

S. T. Roweis and L. K. Saul, Nonlinear Dimensionality Reduction by Locally Linear Embedding, Science, vol.290, issue.5500, pp.2323-2326, 2000.
DOI : 10.1126/science.290.5500.2323

M. Belkin and P. Niyogi, Laplacian Eigenmaps for Dimensionality Reduction and Data Representation, Neural Computation, vol.15, issue.6, pp.1373-1396, 2003.
DOI : 10.1126/science.290.5500.2319

D. L. Donoho and C. Grimes, Hessian eigenmaps: Locally linear embedding techniques for high-dimensional data, Proceedings of the National Academy of Sciences, vol.100, issue.10, pp.5591-5596, 2003.
DOI : 10.1073/pnas.1031596100

S. T. Roweis, L. K. Saul, and G. Hinton, Global coordination of local linear models, Advances in Neural Information Processing Systems, pp.889-896, 2001.

S. Wold, A. Ruhe, H. Wold, and W. J. Dunn, The Collinearity Problem in Linear Regression. The Partial Least Squares (PLS) Approach to Generalized Inverses, SIAM Journal on Scientific and Statistical Computing, vol.5, issue.3, pp.735-743, 1984.
DOI : 10.1137/0905052

T. Kohonen, Self-organized formation of topologically correct feature maps, Biological Cybernetics, vol.13, issue.1, pp.59-69, 1982.
DOI : 10.1007/BF00337288

T. Kohonen, Self-Organizing Maps, 1997.

S. Wu and T. W. Chow, PRSOM: A New Visualization Method by Hybridizing Multidimensional Scaling and Self-Organizing Map, IEEE Transactions on Neural Networks, vol.16, issue.6, pp.1362-1380, 2005.
DOI : 10.1109/TNN.2005.853574

H. Yin, ViSOM: A novel method for multivariate data projection and structure visualization, IEEE Transactions on Neural Networks, vol.13, pp.237-243, 2002.

H. Yin, Data visualisation and manifold mapping using the ViSOM, Neural Networks, vol.15, issue.8-9, pp.1005-1016, 2002.
DOI : 10.1016/S0893-6080(02)00075-8

C. M. Bishop, M. Svensén, and C. K. Williams, GTM: The Generative Topographic Mapping, Neural Computation, vol.39, issue.1, 1998.
DOI : 10.1007/BF01889678

A. Kaban and M. Girolami, A combined latent class and trait model for the analysis and visualization of discrete data, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.23, issue.8, pp.859-872, 2001.
DOI : 10.1109/34.946989

N. D. Lawrence, Probabilistic non-linear principal component analysis with Gaussian process latent variable models, Journal of Machine Learning Research, vol.6, pp.1783-1816, 2005.

J. Hérault, A. Oliva, and A. Guérin-dugué, Scene categorization by Curvilinear Component Analysis of low frequency spectra, Proceedings of the European Symposium on Artificial Neural Network (ESANN), Bruges (Belgium), pp.91-96, 1997.

J. A. Lee and M. Verleysen, Nonlinear Projection with the Isotop Method, Proceedings of the International Conference on Artificial Neural Networks (ICANN), pp.933-938, 2002.
DOI : 10.1007/3-540-46084-5_151

J. A. Lee, C. Archambeau, and M. Verleysen, Locally Linear Embedding versus Isotop, Proceedings of the European Symposium on Artificial Neural Networks (ESANN), Bruges (Belgium), pp.527-534, 2003.

N. D. Lawrence and J. Quiñonero-candela, Local distance preservation in the GP-LVM through back constraints, Proceedings of the 23rd international conference on Machine learning , ICML '06
DOI : 10.1145/1143844.1143909

D. Francois, V. Wertz, and M. Verleysen, About the locality of kernels in high-dimensional spaces, Proceedings of the International Symposium on Applied Stochastic Models and Data Analysis (ASMDA), pp.238-245, 2005.

G. Hinton and S. T. Roweis, Stochastic Neighbor Embedding, Proceedings of the Neural Information Processing Systems (NIPS), pp.833-840, 2002.

F. Glover and M. Laguna, Tabu search, 1995.
URL : https://hal.archives-ouvertes.fr/hal-01389283

D. E. Goldberg, Genetic algorithms in search, Machine Learning, 1989.

C. R. Reeves, Genetic algorithms, 1995.

K. A. Dowsland, Simulated Annealing, 1995.
DOI : 10.1007/978-3-540-92910-9_49

P. Eades, A Heuristic for Graph Drawing, Congressus Numerantium: Proceedings of the 13th Manitoba Conference on Numerical Mathematics and Computing

T. Fruchterman and E. Reingold, Graph Drawing by Forcedirected placement, Software-Practice and Experience, pp.1129-1164, 1991.

G. D. Di-battista, P. Eades, R. Tamassia, and I. G. Tollis, Graph Drawing: Algorithms for the Visualization of Graphs, 1999.

M. Chalmers, A linear iteration time layout algorithm for visualizing high-dimensional data, Proceedings of the 7th Conference on Visualization, pp.127-132, 1996.

A. Morrison, G. Ross, and M. Chalmers, Fast Multidimensional Scaling Through Sampling, Springs and Interpolation, Information Visualization, vol.14, issue.1, pp.68-77, 2003.
DOI : 10.1057/palgrave.ivs.9500040

J. X. Li, Visualization of high-dimensional data with relational perspective map, Information Visualization, vol.3, issue.1, pp.49-59, 2004.
DOI : 10.1057/palgrave.ivs.9500051

A. Frick, A. Ludwig, and H. Mehldau, A Fast Adaptive Layout Algorithm for Undirected Graphs, Proceedings of International Workshop on Graph Drawing, 1994.

N. Quinn and M. Breuer, A forced directed component placement procedure for printed circuit boards, IEEE Transactions on Circuits and Systems, vol.26, issue.6, pp.377-388, 1979.
DOI : 10.1109/TCS.1979.1084652

S. V. Edwards, B. Fertil, A. Giron, and P. J. Deschavanne, A Genomic Schism in Birds Revealed by Phylogenetic Analysis of DNA Strings, Systematic Biology, vol.51, issue.4, pp.599-613, 2002.
DOI : 10.1080/10635150290102285

C. Chapus, C. Dufraigne, S. Edwards, A. Giron, B. Fertil et al., Exploration of phylogenetic data using a global sequence analysis method, BMC Evolutionary Biology, vol.5, issue.1, p.63, 2005.
DOI : 10.1186/1471-2148-5-63

URL : https://hal.archives-ouvertes.fr/inserm-00091302

Y. L. Yap, X. W. Zhang, and A. Danchin, Relationship of SARS- CoV to other pathogenic RNA viruses explored by tetranucleotide usage profiling, BMC Bioinformatics, p.43, 2003.

V. Michel and . Born, He received the M.S. and Ph.D. degrees in Electrical Engineering from the Université catholique de Louvain (Belgium) in 1987 and 1992, respectively. He was an Invited Professor at the SwissEcole Polytechnique Fédérale de Lausanne, Switzerland) in 1992, at the Université d'Evry Val d'Essonne (France) in 2001, and at the Université Paris I Panthéon, He is now a Research Director of the Belgian F.N.R.S, 1965.