Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

DNA-Based Indexing

Max H. Garzon, Kiran C. Bobba, Andrew Neel, Vinhthuy Phan

Source Title: International Journal of Nanotechnology and Molecular Computation (IJNMC) 2(3)

DOI: 10.4018/jnmc.2010070102

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

DNA has been acknowledged as a suitable medium for massively parallel computing and as a “smart” glue for self-assembly. In this paper, a third capability of DNA is described in detail as memory capable of encoding and processing large amounts of data so that information can be retrieved associatively based on content. The technique is based on a novel representation of data on DNA that can shed information on the way DNA-, RNA- and other biomolecules encode information, which may be potentially important in applications to fields like bioinformatics and genetics, and natural language processing. Analyses are also provided of the sensitivity, robustness, and bounds on the theoretical capacity of the memories. Finally, the potential use of the memories are illustrated with two applications, one in genomic analysis for identification and classification, another in information retrieval from text data in abiotic form.

Article Preview

Top

Introduction

Techniques for large-scale compact data representation and mining have been recently introduced. Recent examples are artificial neural networks (Haykin, 1988), Kanerva's associative memories (Kanerva, 1988), and LSI (Latent Semantic Indexing (Deerwester et al., 1990). These memories have made possible methods for representation and processing of large data corpora by powerful methods that provide information useful to humans in semantic terms, unlike the conventional syntactic methods used in electronic databases and data warehouses. The new methods have a set of common features. They are trained on representative sample data sets to build the memory by extracting deep patterns from the sample data, and then they are used on unknown data to perform similar functions with comparable levels of success to that on the known data. Thus we have training algorithms, such as back-propagation with neural nets, learning patterns with Kanerva's memories, and dimension reduction and principal component analysis with LSI.

On the other hand, for over a decade now, Adleman's idea to use DNA for computational purposes has proven fertile ground for computation (Adleman, 1994) and nano-assembly (Seeman, 1999; Winfree et al., 1998). However, the problem of finding a systematic procedure to map both symbolic (abiotic) and nonsymbolic (e.g., biological) information onto biomolecules for massively parallel processing in wet test tubes has faced several challenges. Mapping of non-biological information for processing in vitro is an enormous challenge. Even the easier direct readout problem, i.e., converting genomic data into electronic form for conventional analysis, is an expensive and time-consuming process in bioinformatics (Mount, 2001). Moreover, the results of these analyses are usually only available in manual form that cannot be directly applied to feedback on the carriers of genomic information. In this paper, we propose and discuss an approach that addresses both of these problems.

Three properties are critical for eventual success of such a mapping algorithm/protocol, as discussed in (Blain & Garzon, 2004). First, the representation or indexing has to be universal, scalable, automatic, and fast. Universal means that any kind of symbolic data/pattern can be mapped, in principle, to DNA. Otherwise the mapping will restrict the kind of information mapped, and the processing capabilities in DNA form may be too constrained. Scalable means that mapping can only be justified in massive quantities that cannot be processed by conventional means in reasonable times. Therefore it must be scalable to the tera-bytes and higher orders it must and will eventually encounter. Currently, no cost-effective techniques exist for transferring these volumes by manual addition and extraction of patterns one by one. Ordinary symbol wise transductions require manually manufacturing the corresponding DNA strands, an impossible task with current technology. The indexing must also be automatic and high-speed because manual mapping, e.g., by synthesis of individual strands, is also very costly time wise. An effective strategy must be automatable and eventually orders of magnitude faster than processing of the data in silico.

Complete Article List

Search this Journal:

Reset

Volume 3: 4 Issues (2011)

Volume 2: 4 Issues (2010)

Volume 1: 4 Issues (2009)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

DNA-Based Indexing

Abstract

Introduction

Complete Article List