Abstract
This paper describes work that attempts to use language technology as a solution to the problem of information overload. The specific domain of application is the database of company announcements accessible via the web site of the Australian Stock Exchange: to meet regulatory requirements, over 100,000 documents a year are made available via this site, with only limited search facilities. We use a variety of techniques from language technology to make it easier to explore and manage the information in this data set. In this paper, we focus on our use of information extraction, which identifies and extracts important elements of information from a document, and text compaction, which applies linguistically-motived substitutions to reduce potential summary sentences to more compact forms. Together, these techniques provide a way of producing summaries of a significant proportion of the document base.
Original language | English |
---|---|
Title of host publication | Proceedings of 2005 IEEE |
Subtitle of host publication | International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05): Oct. 30 - Nov. 1, Wuhan, China |
Editors | Chengqing Zong, Maosong Sun |
Place of Publication | Piscataway, NJ |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 651-656 |
Number of pages | 6 |
ISBN (Print) | 0780393619, 9780780393615 |
DOIs | |
Publication status | Published - 2005 |
Event | 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE'05 - Wuhan, China Duration: 30 Oct 2005 → 1 Nov 2005 |
Other
Other | 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE'05 |
---|---|
Country/Territory | China |
City | Wuhan |
Period | 30/10/05 → 1/11/05 |