Analytical Platforms and Databases 233
MassBank
http://www.massbank.jp/index.html?lang=en
MassBank (Horai et al., 2008) covers high-resolution spectra from a wide range of hybrid
MS, ESI-MS and tandem MS analyses, and includes comprehensive data on spectra
measured under various experimental conditions (ionization methods, collision methods,
etc.). Presently, MassBank contains over 10,000 mass spectra and 1000 compounds. Many
compounds were measured by multiple MS methodologies such as quadruple time-of-flight
MS (QqTOF-MS), GC/TOF-MS, LC/TOF-MS, LC/triple quadruple tandem-MS/MS
(LC/QqQ-MS/MS) and capillary electrophoresis/TOF-MS (CE/TOF-MS). Moreover,
MassBank database can be queried by spectral similarity which is based on a vector space
search algorithm and is expanded to obtain real value (m/z) data.
Madison Metabolomics Consortium Database
http://mmcd.nmrfam.wisc.edu/
Madison Metabolomics Consortium Database (MMCD) (Cui et al., 2008b) contains data
based on NMR spectroscopy and MS measurement. Presently, it serves as a hub for
information obtained on small molecules of biological interest from databases and
scientific literature. Information on each metabolite entry in the MMCD is available in an
average of 50 separate data fields. These data fields provide chemical formulae, names and
synonyms, structural data, physical and chemical properties, NMR and MS data of pure
compounds under defined conditions where available, NMR chemical shifts determined by
empirical and/or theoretical approaches, calculated isotopomer masses, and information on
the presence of the metabolite in different biological species. Further, extensive links to
images, references and other public databases such as KEGG (Kanehisa et al., 2002) and
PubChem (Wang et al., 2009) are provided. The MMCD search engine supports versatile
data mining and allows users to make individual or bulk queries on the basis of
experimental NMR and/or MS data as well as other criteria. The database mainly contains
Arabidopsis data; however, it can also be referred to other organisms. In MMCD, the entry
for each compound contains two-dimensional and three-dimensional representations of
covalent structure, calculated masses for several different monoisotopic compositions,
empirically predicted chemical shifts and, where available, experimental NMR chemical
shifts and LC-MS data collected under defined conditions.
MS/MS spectral tag libraries at PRIMe
http://prime.psc.riken.jp/lcms/ms2tview/ms2tview.html
The objective of ‘non-targeted’ metabolic profiling analysis is to understand the
mechanisms underlying metabolic events in plants by determining all detectable
metabolites. Of the various profiling techniques, non-targeted analysis using LC-MS is a
promising tool for investigating the diversity of phytochemicals (Villas-Boas et al., 2005;
Dettmer et al., 2007; Bottcher et al., 2008); this tool is known to be as effective as methods
employing GC-MS (Tikunov et al., 2005; Moco et al., 2007). MS/MS spectral tag (MS2T)
libraries (Matsuda et al., 2009) of metabolites in Arabidopsis are created from a set of
MS/MS spectra acquired using the automatic data acquisition function of the mass
spectrometer. A peak annotation procedure based on the MS2T library was developed for
informative non-targeted metabolic profiling analysis involving LC-MS. From this library,