BIO-AJAX

An extensible framework for biological data cleaning

Katherine Herbert, Narain H. Gehani, William H. Piel, Jason T L Wang, Cathy H. Wu

Research output: Contribution to journalArticleResearchpeer-review

14 Citations (Scopus)

Abstract

As databases become more pervasive through the biological sciences, various data quality issues regarding data legacy, data uniformity and data duplication arise. Due to the nature of this data, each of these problems is non-trivial. For biological data to be corrected and standardized, new methods and frameworks must be developed. This paper proposes one such framework, called BIO-AJAX, which uses principles from data cleaning to improve data quality in biological information systems, specifically in TreeBASE.

Original languageEnglish
Pages (from-to)51-57
Number of pages7
JournalSIGMOD Record
Volume33
Issue number2
StatePublished - 1 Jun 2004

Fingerprint

Cleaning
Information systems

Cite this

Herbert, K., Gehani, N. H., Piel, W. H., Wang, J. T. L., & Wu, C. H. (2004). BIO-AJAX: An extensible framework for biological data cleaning. SIGMOD Record, 33(2), 51-57.
Herbert, Katherine ; Gehani, Narain H. ; Piel, William H. ; Wang, Jason T L ; Wu, Cathy H. / BIO-AJAX : An extensible framework for biological data cleaning. In: SIGMOD Record. 2004 ; Vol. 33, No. 2. pp. 51-57.
@article{d776b4f2f2e740f28757e181d05d0d13,
title = "BIO-AJAX: An extensible framework for biological data cleaning",
abstract = "As databases become more pervasive through the biological sciences, various data quality issues regarding data legacy, data uniformity and data duplication arise. Due to the nature of this data, each of these problems is non-trivial. For biological data to be corrected and standardized, new methods and frameworks must be developed. This paper proposes one such framework, called BIO-AJAX, which uses principles from data cleaning to improve data quality in biological information systems, specifically in TreeBASE.",
author = "Katherine Herbert and Gehani, {Narain H.} and Piel, {William H.} and Wang, {Jason T L} and Wu, {Cathy H.}",
year = "2004",
month = "6",
day = "1",
language = "English",
volume = "33",
pages = "51--57",
journal = "SIGMOD Record",
issn = "0163-5808",
publisher = "Association for Computing Machinery (ACM)",
number = "2",

}

Herbert, K, Gehani, NH, Piel, WH, Wang, JTL & Wu, CH 2004, 'BIO-AJAX: An extensible framework for biological data cleaning', SIGMOD Record, vol. 33, no. 2, pp. 51-57.

BIO-AJAX : An extensible framework for biological data cleaning. / Herbert, Katherine; Gehani, Narain H.; Piel, William H.; Wang, Jason T L; Wu, Cathy H.

In: SIGMOD Record, Vol. 33, No. 2, 01.06.2004, p. 51-57.

Research output: Contribution to journalArticleResearchpeer-review

TY - JOUR

T1 - BIO-AJAX

T2 - An extensible framework for biological data cleaning

AU - Herbert, Katherine

AU - Gehani, Narain H.

AU - Piel, William H.

AU - Wang, Jason T L

AU - Wu, Cathy H.

PY - 2004/6/1

Y1 - 2004/6/1

N2 - As databases become more pervasive through the biological sciences, various data quality issues regarding data legacy, data uniformity and data duplication arise. Due to the nature of this data, each of these problems is non-trivial. For biological data to be corrected and standardized, new methods and frameworks must be developed. This paper proposes one such framework, called BIO-AJAX, which uses principles from data cleaning to improve data quality in biological information systems, specifically in TreeBASE.

AB - As databases become more pervasive through the biological sciences, various data quality issues regarding data legacy, data uniformity and data duplication arise. Due to the nature of this data, each of these problems is non-trivial. For biological data to be corrected and standardized, new methods and frameworks must be developed. This paper proposes one such framework, called BIO-AJAX, which uses principles from data cleaning to improve data quality in biological information systems, specifically in TreeBASE.

UR - http://www.scopus.com/inward/record.url?scp=4444286281&partnerID=8YFLogxK

M3 - Article

VL - 33

SP - 51

EP - 57

JO - SIGMOD Record

JF - SIGMOD Record

SN - 0163-5808

IS - 2

ER -

Herbert K, Gehani NH, Piel WH, Wang JTL, Wu CH. BIO-AJAX: An extensible framework for biological data cleaning. SIGMOD Record. 2004 Jun 1;33(2):51-57.