Adam Godzik, PhD received a Ph.D. in physics in 1990 from the University of Warsaw in Poland. His primary interest is in analysis of biological data in new ways to find hidden trends and regularities. His group has developed algorithms to find hidden patterns in protein sequences (FFAS), identify flexibilities in protein structures (FATCAT) and cluster very large sets of protein or nucleic acid sequences (CD-HIT). Along the way, he developed expertise in managing large amounts of data by developing tailored databases, such as custom made database collecting data from a high throughput protein structure determination pipeline at the Joint Center for Structural Genomics (JCSG), which is also used to automatically deposit data to public databases. His group has also developed several boutique databases to store information on protein structural alignments (FSN), proteolytic events (CutDB) and on protein annotations (TOPSAN), in the latter experimenting with format free data collection.