Peptide Sequencing Teaching Tool

Overview

This project is an extension of the computational biology research I contributed to under Professor Matthew Tien. One of the first projects I worked on was a novel approach to peptide sequencing. Proteomics (the large-scale study of peptides and proteins) is an increasingly powerful and indispensable technology in molecular cell biology (Pandey and Mann, 2000). The foundational technology in mass-spectrometry-based proteomics is peptide sequencing. However, it is often not well understood (Steen and Mann, 2004). Professor Tien was teaching his Intro to Comp Bio (BIOL-250) class about mass spectrometry, the algorithms used, and how to interpret the data. However, he predicted his students would struggle to understand peptide sequencing and mass spectrometry data. This led Professor Tien and me to create the problem statement: how might we develop a tool that helps students in his Intro to Comp Bio (BIOL-250) class understand and interpret peptide sequencing, mass spectrometry, and its data?

Solving this problem statement involved three phases: (i) working with Professor Tien and adhering to HCD principles to develop the prototype teaching tool; (ii) presenting the tool in his class, demonstrating it, and helping students use it; and (iii) using feedback and observations from Professor Tien and his students to iteratively redesign the teaching tool’s design for future use. In this design brief, I overview the design process I used throughout these steps and how I applied HCD and Design Thinking principles in this project.

Final Design

The final design is an online graphical user interface (GUI) that visually represents both a random peptide and the simulated mass spectrometry output (Figure A).

Figure A

The random peptide is represented by a vertical array of amino acids and their masses (left). These can be dragged and dropped to rearrange their order. Users are given two spectrum outputs (right), which are generated based on the “Current” and “Goal” peptides. By rearranging the order of the amino acids, the “Current Masses” spectrum dynamically responds, helping students understand the relationship between the peptide and mass spectrometry data. The “Goal Masses” are generated based on a hidden “Goal” peptide. When a simulated “current” mass aligns with a simulated “goal” mass, it turns from red to green. When the peptides match, all the masses in the current spectrum turn green.

Figure B

Figure C

Figure D

Figure E

Users can also hover over the bars, which show the summed amino acid masses of that fragment (Figure B). Clicking on a bar highlights the amino acid fragments that sum to the clicked frequency. This can be done for current or goal mass frequencies (Figure C/D). Clicking goal frequencies will highlight the amino acid fragment that needs to be in order to reconstruct the peptide (Figure C), which the user can use to sequence the peptide (Figure D). The graphical representations on the right represent the relative abundance as a function of the mass-to-charge ratio generated from simulating mass spectrometry on the peptide.

References

Pandey, A., & Mann, M. (2000). Proteomics to study genes and genomes. Nature, 405(6788), 837–846. https://doi.org/10.1038/35015709

Steen, H., & Mann, M. (2004). The abc’s (and xyz’s) of peptide sequencing. Nature Reviews Molecular Cell Biology, 5(9), 699–711. https://doi.org/10.1038/nrm1468