Abstract
As databases become more pervasive through the biological sciences, various data quality issues regarding data legacy, data uniformity and data duplication arise. Due to the nature of this data, each of these problems is non-trivial. For biological data to be corrected and standardized, new methods and frameworks must be developed. This paper proposes one such framework, called BIO-AJAX, which uses principles from data cleaning to improve data quality in biological information systems, specifically in TreeBASE.
Original language | English |
---|---|
Pages (from-to) | 51-57 |
Number of pages | 7 |
Journal | SIGMOD Record |
Volume | 33 |
Issue number | 2 |
State | Published - 1 Jun 2004 |
Fingerprint
Cite this
}
BIO-AJAX : An extensible framework for biological data cleaning. / Herbert, Katherine; Gehani, Narain H.; Piel, William H.; Wang, Jason T L; Wu, Cathy H.
In: SIGMOD Record, Vol. 33, No. 2, 01.06.2004, p. 51-57.Research output: Contribution to journal › Article
TY - JOUR
T1 - BIO-AJAX
T2 - An extensible framework for biological data cleaning
AU - Herbert, Katherine
AU - Gehani, Narain H.
AU - Piel, William H.
AU - Wang, Jason T L
AU - Wu, Cathy H.
PY - 2004/6/1
Y1 - 2004/6/1
N2 - As databases become more pervasive through the biological sciences, various data quality issues regarding data legacy, data uniformity and data duplication arise. Due to the nature of this data, each of these problems is non-trivial. For biological data to be corrected and standardized, new methods and frameworks must be developed. This paper proposes one such framework, called BIO-AJAX, which uses principles from data cleaning to improve data quality in biological information systems, specifically in TreeBASE.
AB - As databases become more pervasive through the biological sciences, various data quality issues regarding data legacy, data uniformity and data duplication arise. Due to the nature of this data, each of these problems is non-trivial. For biological data to be corrected and standardized, new methods and frameworks must be developed. This paper proposes one such framework, called BIO-AJAX, which uses principles from data cleaning to improve data quality in biological information systems, specifically in TreeBASE.
UR - http://www.scopus.com/inward/record.url?scp=4444286281&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:4444286281
VL - 33
SP - 51
EP - 57
JO - SIGMOD Record
JF - SIGMOD Record
SN - 0163-5808
IS - 2
ER -