What is Next In Proteomics: Parsing 20 Years of Public Data by AI Maps Trends in Proteomics and Forecasts Technology

The trends of the last 20 years in biotechnology were revealed using Consilience’s artificial intelligence and natural language processing (NLP) of publicly available data. Implementing this “science-of-science” approach, we capture convergent trends in the field of proteomics in both technology development and application across the phylogenetic tree of life. With major gaps in our knowledge about protein composition, structure, and location over time, we report trends in persistent, popular approaches and emerging technologies across 94 ideas from a corpus of 29 journals in PubMed over two decades. New metrics for clusters of these ideas reveal the progression and popularity of emerging approaches like single-cell, spatial, compositional, and chemical proteomics designed to better capture protein-level chemistry and biology. This analysis of the proteomics literature with advanced analytic tools quantifies the Rate of Rise for a next generation of technologies to better define, quantify, and visualize the multiple dimensions of the proteome that will transform our ability to measure and understand proteins in the coming decade.

Longitudinal view across 3-year epochs from 1999 to 2023 of the percentile rank of average Popularity (a) and Rate of Rise (b) of all the individual ideas within the key technology themes (Compositional Proteomics, Chemical Proteomics, Single-Cell Proteomics, Structural Proteomics and Spatial Proteomics) and the longitudinal view across 3-year epochs from 1999 to 2023 of the percentile rank of Popularity (c) and Rate of Rise (d) for ideas in the Spatial Proteomics theme.
A corpus of 29 journals in PubMed were analyzed.