UB Receives $1 Million from NSF to Fund System to Store Huge Amounts of Computer Data

Essential to work in bioinformatics, geographic information science

Release Date: December 21, 2001 This content is archived.

Print

BUFFALO, N.Y. -- A $1 million National Science Foundation infrastructure award to store, manage and analyze complex scientific data is boosting pioneering research at the University at Buffalo in bioinformatics, geographic information science and other important research areas.

Nearly $600,000 in matching funds will be contributed to the project by UB and the Strategically Targeted Academic Research (STAR) Center for Disease Modeling and Therapy Discovery, funded by the New York State Office of Science, Technology and Academic Research.

The highly competitive grant provides UB with the computational infrastructure necessary to manage, analyze and visualize large-scale multidimensional data sets that lie at the heart of some of UB's most cutting-edge research in drug design, molecular-structure determination and the understanding of complex geographic images.

"The timing of our receiving this grant now from the National Science Foundation is key," said Jaylan S. Turkkan, Ph.D., UB vice president for research.

"In conjunction with our new Center of Excellence in Bioinformatics, it will ensure that UB is at the leading edge not only in high-capacity storage of large datasets that form the knowledge base for bioinformatics, geographic information science and other important research areas, but also in developing the complex software tools that will be needed for understanding and analyzing the data.

"The combination of these two events is a one-two punch in terms of leveraging UB for large-scale center grants from the National Institutes of Health and other federal agencies," Turkkan added.

The new system will provide the UB Department of Computer Science and Engineering with 20 times the amount of data storage it now has in a centrally managed resource that will assist affiliated researchers working in the UB departments of geography and pharmaceutical sciences and at Hauptman-Woodward Medical Research Institute and Roswell Park Cancer Institute.

In emerging disciplines, such as bioinformatics and geographic information science (GIS), the issue of data storage has shifted dramatically from one easily solved by filing cabinets and the hard drives of PCs to one that can only be solved by extremely specialized computer equipment.

"The bottom line is if you cannot store the data, you cannot do the research," stated Aidong Zhang, Ph.D., UB associate professor of computer science and engineering and principal investigator on the grant.

Research projects that are named on the grant include:

o Determination of the 3-dimensional structure of proteins

o Metadata and knowledge extraction, representation and management in geographic information

o Gene expression data analysis

o Data visualization

o Data management

By the end of the five-year grant, UB will have a total of 20 terabytes of computer storage space. A single terabyte is 1 million megabytes. Twenty terabytes is approximately the same amount of storage found in one thousand PCs, although researchers caution that the new system is far more than just a fantastically large storage space.

"It's not just this huge amount of storage. It's high-performance storage that enables large amounts of data to be moved around and accessed quickly and easily," said Robert Straubinger, Ph.D., associate professor of pharmaceutical sciences, who is a co-investigator on the grant.

Zhang pointed out that in addition, the new system will be extremely reliable, another prerequisite for storing such vast amounts of data, since losing so much data would be catastrophic for any research project.

"Data is written in multiple places in the system, so that a hardware failure won't result in data loss," she said.

The new infrastructure also will facilitate the kind of computational research required by these emerging disciplines.

"In pharmacogenomics, for example, there is a tremendous need for computational research focused on analysis of the kinds of data we generate, with the objective of better understanding the mechanism of drug action or how cells and tissues respond to drugs," said Straubinger.

Zhang, whose expertise is in database management and data mining, is working with other UB researchers to develop just those tools.

Having these machines on campus, she explained, will enable UB's computer scientists to begin to develop the data analysis and visualization systems that will make working with such large data sets more efficient.

Media Contact Information

Ellen Goldbaum
News Content Manager
Medicine
Tel: 716-645-4605
goldbaum@buffalo.edu