Article

Free Access

Collective content selection for concept-to-text generation

Authors:
Regina Barzilay

Massachusetts Institute of Technology

Massachusetts Institute of Technology
View Profile

,
Mirella Lapata

University of Edinburgh

University of Edinburgh
View Profile

HLT '05: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language ProcessingOctober 2005Pages 331–338https://doi.org/10.3115/1220575.1220617

Published:06 October 2005Publication History

HLT '05: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing

Pages 331–338

ABSTRACT

A content selection component determines which information should be conveyed in the output of a natural language generation system. We present an efficient method for automatically learning content selection rules from a corpus and its related database. Our modeling framework treats content selection as a collective classification problem, thus allowing us to capture contextual dependencies between input items. Experiments in a sports domain demonstrate that this approach achieves a substantial improvement over context-agnostic methods.

References

J. Besag. 1986. On the statistical analysis of dirty pictures. Journal of the Royal Statistical Society, 48:259--302.Google Scholar
Y. Boykov, O. Veksler, R. Zabih. 1999. Fast approximate energy minimization via graph cuts. In ICCV, 377--384. Google ScholarDigital Library
P. A. Duboue, K. R. McKeown. 2003. Statistical acquisition of content selection rules for natural language generation. In Proceedings of the EMNLP, 121--128. Google ScholarDigital Library
D. Greig, B. Porteous, A. Seheult. 1989. Exact maximum a posteriori estimation for binary images. Journal of the Royal Statistical Society, 51(2):271--279.Google Scholar
K. Kukich. 1983. Design of a knowledge-based report generator. In Proceedings of the ACL, 145--150. Google ScholarDigital Library
J. Kupiec, J. O. Pedersen, F. Chen. 1995. A trainable document summarizer. In Proceedings of the SIGIR, 68--73. Google ScholarDigital Library
K. R. McKeown. 1985. Text Generation: Using Discourse Strategies and Focus Constraints to Generate Natural Language Text. Cambridge University Press. Google ScholarDigital Library
B. Pang, L. Lee. 2004. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the ACL, 271--278, Barcelona, Spain. Google ScholarDigital Library
G. Parks. 1990. An intelligent stochastic optimization routine for nuclear fuel cycle design. Nuclear Technology, 89:233--246.Google ScholarCross Ref
E. Reiter, R. Dale. 2000. Building Natural Language Generation Systems. Cambridge University Press, Cambridge. Google ScholarDigital Library
J. Robin. 1994. Revision-Based Generation of Natural Language Summaries Providing Historical Background. Ph.D. thesis, Columbia University. Google ScholarDigital Library
M. Rogati, Y. Yang. 2002. High-performing feature selection for text classification. In Proceedings of the CIKM, 659--661. Google ScholarDigital Library
R. E. Schapire, Y. Singer. 2000. Boostexter: A boosting-based system for text categorization. Machine Learning, 39(2/3):135--168. Google ScholarDigital Library
S. G. Sripada, E. Reiter, J. Hunter, J. Yu. 2001. A two-stage model for content determination. In Proceedings of the ACL-ENLG, 3--10. Google ScholarDigital Library
K. Tanaka-Ishii, K. Hasida, I. Noda. 1998. Reactive content selection in the generation of real-time soccer commentary. In Proceedings of the ACL/COLING, 1282--1288. Google ScholarDigital Library
B. Taskar, P. Abbeel, D. Koller. 2002. Discriminative probabilistic models for relational data. In Proceedings of the UAI, 485--495. Google ScholarDigital Library

Collective content selection for concept-to-text generation
1. Computing methodologies
2. Hardware
  1. Power and energy
    1. Power estimation and optimization

Recommendations

Understanding collective content: purposes, characteristics and collaborative practices
C&T '09: Proceedings of the fourth international conference on Communities and technologies

User-created media content is being increasingly shared with the communities people belong to. The content has a role of a motivator in social interaction within the communities. In fact, the content creation and management can be often seen as a ...
Read More
Content-Based Tag Generation for the Grouping of Tags
ELML '09: Proceedings of the 2009 International Conference on Mobile, Hybrid, and On-line Learning

A tagging system can encounter too few or too many tags. To solve these problems, we propose a content-based automatic generation of tags. Applied to an e-Learning 2.0 application, the proposal creates tags based on lecture slide contents, generating an ...
Read More
Data-to-text generation with content selection and planning
AAAI'19/IAAI'19/EAAI'19: Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence

Recent advances in data-to-text generation have led to the use of large-scale datasets and neural network models which are trained end-to-end, without explicitly modeling what to say and in what order. In this work, we present a neural network ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
HLT '05: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
October 2005
1054 pages
Conference Chair:
Raymond J. Mooney
The University of Texas at Austin
Sponsors
In-Cooperation
Publisher
Association for Computational Linguistics
United States
Publication History
- Published: 6 October 2005
Qualifiers
- Article
Conference

Acceptance Rates
HLT '05 Paper Acceptance Rate127of402submissions,32%Overall Acceptance Rate240of768submissions,31%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 28
  Total Citations
  View Citations
- 761
  Total Downloads
- Downloads (Last 12 months)35
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Collective content selection for concept-to-text generation

HLT '05: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing

ABSTRACT

References

Cited By

Recommendations

Understanding collective content: purposes, characteristics and collaborative practices

Content-Based Tag Generation for the Grouping of Tags

Data-to-text generation with content selection and planning

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Collective content selection for concept-to-text generation

HLT '05: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing

ABSTRACT

References

Cited By

Recommendations

Understanding collective content: purposes, characteristics and collaborative practices

Content-Based Tag Generation for the Grouping of Tags

Data-to-text generation with content selection and planning

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media