Summary
Our goals are to understand the structural properties of biological molecules and to relate these structures to their biological functions. Toward these goals, we study the structures of protein-nucleic acid complexes in order help to explain the principles that underlie interactions between nucleic acids and proteins, as well as to increase our understanding of the role of water in mediating intermolecular interactions.
We have also established a program in structural bioinformatics. A key focus of this work has been to establish methods to collect, archive, and analyze structural data. In 1991, we created the Nucleic Acid Database (NDB). In October 1998, the management of the Protein Data Bank (PDB) was moved to the Research Collaboratory for Structural Bioinformatics, of which Rutgers is the lead site.
Bioinformatics: The PDB and the NDB
We are responsible for the continued development and operation of two archival databases: the Protein Data Bank (PDB;
http://www.pdb.org) and the Nucleic Acid Database (NDB;
http://ndbserver.rutgers.edu)
The PDB is managed by two members of the Research Collaboratory for Structural Bioinformatics (RCSB): Rutgers University and the San Diego Supercomputer Center and the Skaggs School of Pharmacy and Pharmaceutical Sciences at the University of California, San Diego. It is a member of the wwPDB (
www.wwpdb.org).
The PDB is the single worldwide depository of information about the three-dimensional structures of large biological molecules, including proteins and nucleic acids. These are the molecules of life that are found in all organisms including bacteria, yeast, plants, flies, and mice, and in healthy as well as diseased humans. Understanding the shape of a molecule helps to understand how it works.
The Nucleic Acid Database assembles and distributes structural information about nucleic acid structures. The core of the NDB is the relational database of the structural data of three-dimensional nucleic acid-containing crystal structures.
Binary and Ternary Complexes with Catabolite Activating Protein (CAP)

Although CAP makes no direct contact with the consensus base pair T:A at position 6 of DNA half site 5’-A
1A
2A
3T
4G
5T
6G
7A
8T
9C
10T
11-3’, it exhibits strong specificity for T:A at position 6. We have determined and compared crystallographic structures of CAP in complex with DNA sites having T:A at position 6 or the non-consensus base pair C:G at position 6, analyzing two DNA fragment lengths and two crystal forms. The results indicate that indirect readout in this system is mediated through altered DNA deformability.
Substitution of the carboxylate side chain of Glu181 by the one-methylene-group-shorter carboxylate side chain of Asp results in a new DNA binding specificity at position 6 of the DNA half site. Thus, whereas wild-type CAP prefers T:A at position 6, [181Asp]CAP prefers C:G at position 6. This specificity is observed
in vitro and
in vivo and is large in magnitude. To define the basis for this unexpected change in specificity, we have carried out structural studies of [181Asp]CAP in complex with DNA. The results indicate that the Glu Asp substitution eliminates the primary kink and thus eliminates indirect-readout-based specificity for T:A at position 6.
CAP activates transcription at P
lac and other promoters through interactions with the RNA polymerase alpha-CTD subunit. We have determined the structure of CAP in complex with DNA and alpha-CTD at 3.3Å. This structure provides the first high-resolution view of a transcriptional activator in complex with a functional target within the general transcription machinery.
DNA binding proteinsIn collaboration with Drs. Jannette Carey (Princeton University) and Cathy Lawson, we have determined the structure of E. coli trp repressor (trpR) in crystals grown from 30% aqueous isopropanol. Our current structural view of trpR as a compact alpha-helical homodimer is based largely on experiments performed in high salt. In the new structure, individual trpR subunits have a dramatically extended conformation and undergo domain swapping. Similar extended subunit structures may contribute to trpR aggregation phenomena.
Quantitative Rules for Discerning DNA-Binding Proteins Relative to the Site of DNA Interaction
We are working to develop simple and quantitative rules that can be used to identify proteins that are likely to bind to DNA. The method used here creates a statistical model of a protein fold relative to the site of DNA interaction. The statistical model is based on measurements that treat the secondary structure elements near the site of interaction as individual units. Analysis of the measurements for the Helix-Turn-Helix (HTH) DNA-binding motif show that they can be used to discern proteins with the motif with high accuracy. Further analysis will extend to other DNA-binding motifs and whole protein domains.
RNA VisualizationWe have also begun to develop new visualization tools to display RNA structures. RNADraw automatically displays the tertiary interactions on a 2D format. The 2D picture provides detailed contact information between the base pairs and provides a fast and simple way to compare related structures. We are planning to use this tool to help classify the different folds in RNA.
Director, Protein Data Bank
2006 M.J. Buerger Award
Representative Publications
The Worldwide Protein Data Bank (wwPDB): Ensuring a single, uniform archive of PDB data, Helen M. Berman, Kim Henrick, Haruki Nakamura, and John L. Markley,
Nucleic Acids Res., 2007 35 (Database issue): D301-3.
Outcome of a Workshop on Archiving Structural Models of Biological Macromolecules, Helen M. Berman, Stephen K. Burley, Wah Chiu, Andrej Sali, Alexei Adzhubei, Philip E. Bourne, Stephen H. Bryant, Roland L. Dunbrack, Jr., Krzysztof Fidelis, Joachim Frank, Adam Godzik, Kim Henrick, Andrzej Joachimiak, Bernard Heymann, David Jones, John L. Markley, John Moult, Gaetano T. Montelione, Christine Orengo, Michael G. Rossmann, Burkhard Rost, Helen Saibil, Torsten Schwede, Daron M. Standley, John D. Westbrook,
Structure, 2006 14/8:1211-1217.
Reflections on the Science and Law of Structural Biology, Genomics, & Drug Development, Helen M. Berman, Rochelle C. Dreyfuss,
UCLA Law Review, 2006 53:871-908.
The RCSB PDB Information Portal for Structural Genomics, Andrei Kouranov, Lei Xie, Joanna de la Crux, Li Chen, John Westbrook, Philip Bourne and Helen Berman,
Nucleic Acids Research, 2006 Vol 34, Database issue:D302-D305.
RNA conformational classes, Bohdan Schneider, Zdeněk Morávek, and Helen M. Berman,
Nucleic Acid Research, 32(5):1666-1677, 2004.
Benoff, B, Yang, H, Lawson, C, Parkinson, G, Liu, J, Blatter, E, Ebright, YW, Berman, HM, Ebright, RH (2002): Structural basis of transcription activation: The CAP-alphaCTD-DNA complex. Science 297:1562-1566.
Berman, HM, Goodsell, DS, Bourne, PE (2002): Protein structures: from famine to feast. American Sci. 90:350-359.
McLaughlin, WA and Berman, HM (2003): Statistical models for discerning protein structures containing the DNA-binding helix-turn-helix motif. J. Mol. Biol. 330:43-55.
Neidle, S, Schneider, B, Berman, HM (2003). Fundamentals of DNA and RNA structure. Structural Bioinformatics. P. E. Bourne and H. Weissig.Hoboken, NJ, John Wiley & Sons, Inc.: 41-73.
Bella, J, Eaton, M, Brodsky, B, Berman, HM (1994): Crystal and molecular structure of a collagen-like peptide at 1.9 Å resolution. Science 266:75-81.
Berman, HM, Bhat, TN, Bourne, PE, Feng, Z, Gilliland, G, Weissig, H, Westbrook, J (2000): The Protein Data Bank and the challenge of structural genomics. Nat. Struct. Biol. 7:957-959.
Berman, HM, Olson, WK, Beveridge, DL, Westbrook, J, Gelbin, A, Demeny, T, Hsieh, SH, Srinivasan, AR, Schneider, B (1992): The Nucleic Acid Database - a comprehensive relational database of three-dimensional structures of nucleic acids. Biophys. J. 63:751-759.
Berman, HM, Westbrook, J, Feng, Z, Gilliland, G, Bhat, TN, Weissig, H, Shindyalov, IN, Bourne, PE (2000): The Protein Data Bank. Nucleic Acids Res. 28:235-242.
Bourne, PE, Berman, HM, Watenpaugh, K, Westbrook, JD, Fitzgerald, PMD (1997): The macromolecular Crystallographic Information File (mmCIF). Meth. Enzymol. 277:571-590.
Jones, S, Daley, DTA, Luscombe, NM, Berman, HM, Thornton, JM (2001): Protein-RNA interactions: A structural analysis. Nucleic Acids Res. 29:934-954.
Jones, S, van Heyningen, P, Berman, HM, Thornton, JM (1999): Protein-DNA interactions: A structural analysis. J. Mol. Biol. 287:877-896. Kramer, RZ, Vitagliano, L, Bella, J, Berisio, R, Mazzarella, L, Brodsky, B, Zagari, A, Berman, HM (1998): X-ray crystallographic determination of a collagen-like peptide with the repeating sequence (Pro-Pro-Gly). J. Mol. Biol. 280:623-638.
Olson, WK, Bansal, M, Burley, SK, Dickerson, RE, Gerstein, M, Harvey, SC, Heinemann, U, Lu, X-J, Neidle, S, Shakked, Z, Sklenar, H, Suzuki, M, Tung, C-S, Westhof, E, Cynthia Wolberger, Berman, HM (2001): A standard reference frame for the description of nucleic acid base-pair geometry. J. Mol. Biol. 313:229-237.
Parkinson, G, Vojtechovsky, J, Clowney, L, Brünger, AT, Berman, HM (1996): New parameters for the refinement of nucleic acid containing structures. Acta Crystallogr. D52:57-64.
Parkinson, G, Wilson, C, Gunasekera, A, Ebright, YW, Ebright, RE, Berman, HM (1996): Structure of the CAP-DNA complex at 2.5 Å resolution: a complete picture of the protein-DNA interface. J. Mol. Biol. 260:395-408.
Schneider, B and Berman, HM (1995): Hydration of the DNA bases is local. Biophys. J. 69:2661-2669.