Incorporating terminology evolution for query translation in text retrieval with association rules

Amal Kalurachchi, Aparna Varde, Srikanta Bedathur, Gerhard Weikum, Jing Peng, Anna Feldman

Research output: Chapter in Book/Report/Conference proceedingConference contribution

22 Citations (Scopus)

Abstract

Time-stamped documents such as newswire articles, blog posts and other web-pages are often archived online. When these archives cover long spans of time, the terminology within them could undergo significant changes. Hence when users pose queries pertaining to historical information over such documents, the queries need to be translated taking into account these temporal changes in order to provide accurate responses to users. For example, a query on Sri Lanka should automatically retrieve documents with its former name Ceylon. We call such concepts SITACs, i.e., Semantically Identical Temporally Altering Concepts. In order to discover SITACs, we propose an approach based on a novel framework constituting an integration of natural language processing, association rule mining and contextual similarity as a learning technique. The proposed approach has been experimented with real data and has been found to yield good results with respect to efficiency and accuracy.

Original languageEnglish
Title of host publicationCIKM'10 - Proceedings of the 19th International Conference on Information and Knowledge Management and Co-located Workshops
Pages1789-1792
Number of pages4
DOIs
StatePublished - 1 Dec 2010
Event19th International Conference on Information and Knowledge Management and Co-located Workshops, CIKM'10 - Toronto, ON, Canada
Duration: 26 Oct 201030 Oct 2010

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings

Other

Other19th International Conference on Information and Knowledge Management and Co-located Workshops, CIKM'10
CountryCanada
CityToronto, ON
Period26/10/1030/10/10

Fingerprint

Query
Text retrieval
Query translation
Association rules
Association rule mining
Sri Lanka
Blogs
World Wide Web
Natural language processing

Keywords

  • Association rules
  • Contextual similarity
  • Natural language processing
  • Ranking
  • Search
  • Web IR

Cite this

Kalurachchi, A., Varde, A., Bedathur, S., Weikum, G., Peng, J., & Feldman, A. (2010). Incorporating terminology evolution for query translation in text retrieval with association rules. In CIKM'10 - Proceedings of the 19th International Conference on Information and Knowledge Management and Co-located Workshops (pp. 1789-1792). (International Conference on Information and Knowledge Management, Proceedings). https://doi.org/10.1145/1871437.1871730
Kalurachchi, Amal ; Varde, Aparna ; Bedathur, Srikanta ; Weikum, Gerhard ; Peng, Jing ; Feldman, Anna. / Incorporating terminology evolution for query translation in text retrieval with association rules. CIKM'10 - Proceedings of the 19th International Conference on Information and Knowledge Management and Co-located Workshops. 2010. pp. 1789-1792 (International Conference on Information and Knowledge Management, Proceedings).
@inproceedings{31bbf3c92c594c5f98d81211c8139ea6,
title = "Incorporating terminology evolution for query translation in text retrieval with association rules",
abstract = "Time-stamped documents such as newswire articles, blog posts and other web-pages are often archived online. When these archives cover long spans of time, the terminology within them could undergo significant changes. Hence when users pose queries pertaining to historical information over such documents, the queries need to be translated taking into account these temporal changes in order to provide accurate responses to users. For example, a query on Sri Lanka should automatically retrieve documents with its former name Ceylon. We call such concepts SITACs, i.e., Semantically Identical Temporally Altering Concepts. In order to discover SITACs, we propose an approach based on a novel framework constituting an integration of natural language processing, association rule mining and contextual similarity as a learning technique. The proposed approach has been experimented with real data and has been found to yield good results with respect to efficiency and accuracy.",
keywords = "Association rules, Contextual similarity, Natural language processing, Ranking, Search, Web IR",
author = "Amal Kalurachchi and Aparna Varde and Srikanta Bedathur and Gerhard Weikum and Jing Peng and Anna Feldman",
year = "2010",
month = "12",
day = "1",
doi = "10.1145/1871437.1871730",
language = "English",
isbn = "9781450300995",
series = "International Conference on Information and Knowledge Management, Proceedings",
pages = "1789--1792",
booktitle = "CIKM'10 - Proceedings of the 19th International Conference on Information and Knowledge Management and Co-located Workshops",

}

Kalurachchi, A, Varde, A, Bedathur, S, Weikum, G, Peng, J & Feldman, A 2010, Incorporating terminology evolution for query translation in text retrieval with association rules. in CIKM'10 - Proceedings of the 19th International Conference on Information and Knowledge Management and Co-located Workshops. International Conference on Information and Knowledge Management, Proceedings, pp. 1789-1792, 19th International Conference on Information and Knowledge Management and Co-located Workshops, CIKM'10, Toronto, ON, Canada, 26/10/10. https://doi.org/10.1145/1871437.1871730

Incorporating terminology evolution for query translation in text retrieval with association rules. / Kalurachchi, Amal; Varde, Aparna; Bedathur, Srikanta; Weikum, Gerhard; Peng, Jing; Feldman, Anna.

CIKM'10 - Proceedings of the 19th International Conference on Information and Knowledge Management and Co-located Workshops. 2010. p. 1789-1792 (International Conference on Information and Knowledge Management, Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Incorporating terminology evolution for query translation in text retrieval with association rules

AU - Kalurachchi, Amal

AU - Varde, Aparna

AU - Bedathur, Srikanta

AU - Weikum, Gerhard

AU - Peng, Jing

AU - Feldman, Anna

PY - 2010/12/1

Y1 - 2010/12/1

N2 - Time-stamped documents such as newswire articles, blog posts and other web-pages are often archived online. When these archives cover long spans of time, the terminology within them could undergo significant changes. Hence when users pose queries pertaining to historical information over such documents, the queries need to be translated taking into account these temporal changes in order to provide accurate responses to users. For example, a query on Sri Lanka should automatically retrieve documents with its former name Ceylon. We call such concepts SITACs, i.e., Semantically Identical Temporally Altering Concepts. In order to discover SITACs, we propose an approach based on a novel framework constituting an integration of natural language processing, association rule mining and contextual similarity as a learning technique. The proposed approach has been experimented with real data and has been found to yield good results with respect to efficiency and accuracy.

AB - Time-stamped documents such as newswire articles, blog posts and other web-pages are often archived online. When these archives cover long spans of time, the terminology within them could undergo significant changes. Hence when users pose queries pertaining to historical information over such documents, the queries need to be translated taking into account these temporal changes in order to provide accurate responses to users. For example, a query on Sri Lanka should automatically retrieve documents with its former name Ceylon. We call such concepts SITACs, i.e., Semantically Identical Temporally Altering Concepts. In order to discover SITACs, we propose an approach based on a novel framework constituting an integration of natural language processing, association rule mining and contextual similarity as a learning technique. The proposed approach has been experimented with real data and has been found to yield good results with respect to efficiency and accuracy.

KW - Association rules

KW - Contextual similarity

KW - Natural language processing

KW - Ranking

KW - Search

KW - Web IR

UR - http://www.scopus.com/inward/record.url?scp=78651306430&partnerID=8YFLogxK

U2 - 10.1145/1871437.1871730

DO - 10.1145/1871437.1871730

M3 - Conference contribution

AN - SCOPUS:78651306430

SN - 9781450300995

T3 - International Conference on Information and Knowledge Management, Proceedings

SP - 1789

EP - 1792

BT - CIKM'10 - Proceedings of the 19th International Conference on Information and Knowledge Management and Co-located Workshops

ER -

Kalurachchi A, Varde A, Bedathur S, Weikum G, Peng J, Feldman A. Incorporating terminology evolution for query translation in text retrieval with association rules. In CIKM'10 - Proceedings of the 19th International Conference on Information and Knowledge Management and Co-located Workshops. 2010. p. 1789-1792. (International Conference on Information and Knowledge Management, Proceedings). https://doi.org/10.1145/1871437.1871730