Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Extraction and Prediction of Biomedical Database Identifier Using Neural Networks towards Data Network Construction

Hendrik Mehlhorn, Matthias Lange, Uwe Scholz, Falk Schreiber

Source Title: Cases on Open-Linked Data and Semantic Web Applications

DOI: 10.4018/978-1-4666-2827-4.ch004

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

In this work, we investigate to what extent an automated construction of an integrated data network is possible. We propose a method that predicts and extracts cross-references from multiple life science databases and possible referenced data targets. We study the retrieval quality of our method and report on first, promising results.

Chapter Preview

Top

1. Introduction

Bioinformatics is the field of science in which biology, computer science, and in particular information retrieval merge to form a single discipline. The ultimate goal of the field is to enable the discovery of new biological insights. The first step in this direction is already done. High throughput biotechnologies, like next generation sequencing, proteomics and metabolomics techniques produce a massive amount of data (Galperin & Fernandez-Suarez, 2012). But the data gathered in biology or medicine is as manifold as the biological research areas itself. If we will narrow down in this chapter the complex areas of biomedical research to molecular biology, bioinformatics attempts to model and interprets this data pathway: genome, gene sequence, protein sequence, protein structure, protein function, cellular pathways & networks, and biomedical literature. The first consequence of this revolution is the explosion of available data that biomolecular researchers have to harness and exploit (Roos, 2001) (e.g., as of March 2012, Genbank provides access to 150,000,000 DNA sequences¹ and in PubMed there are 2,400,000 research articles listed). The number of public available databases passed currently the number of high water mark of 1,200 (Galperin & Fernandez-Suarez, 2012).

The big players in this context are on the one hand companies like pharmaceutical or plant breeders on the other hand public or private financed research institute. Their role is either a data consumer or a data producer. In consequence there is a raising need for find, extract, merge, and synthesize information from multiple, disparate sources. Convergence of biology, computer science, and information technology will accelerate this multidisciplinary endeavor. The basic needs are formulated in Lacroix & Critchlow, 2003:

1.
On demand access and retrieval of the most up-to-date biological data and the ability to perform complex queries across multiple heterogeneous databases to find the most relevant information.
2.
Access to the best-of-breaded analytical tools and algorithms for extraction of useful information from the massive volume and diversity of biological data.
3.
A robust information integration infrastructure that connects various computational steps involving database queries, computational algorithms, and application software.

In consequence, database integration plays an important role in this context. Thus, we will subsequently briefly introduce the most popular concepts for database integration in life science. Using the World Wide Web or social networks as inspiring example, the basic idea presented in this chapter is to compute a network of biomedical knowledge by taking a set of database entries as input, analyzing the entries and their attributes and identifying potential cross-references in the same and in other databases. We propose IDPredictor, an algorithm that predicts cross-references from multiple life science databases and thus sets the basis for an enhanced information retrieval over biomedical data. We discuss to what extend IDPredictor can be used as method for an efficient and precise prediction of database cross-references.

In Section 1 we give a brief introduction to data management in life sciences. In particular approaches for data integration, information retrieval and aspects of data identifier are discussed. In Section 2 we present the underlying machine learning methods of IDPredictor. In Section 3 we discuss training methods and prediction performance measures. In Section 4 we discuss the prediction performance, preliminary results and the application to database networks.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Extraction and Prediction of Biomedical Database Identifier Using Neural Networks towards Data Network Construction

Abstract

1. Introduction

Complete Chapter List