Learning Clause Representation from Dependency-Anchor Graph for Connective Prediction

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Semantic representation that supports the choice of an appropriate connective between pairs of clauses inherently addresses discourse coherence, which is important for tasks such as narrative understanding, argumentation, and discourse parsing. We propose a novel clause embedding method that applies graph learning to a data structure we refer to as a dependencyanchor graph. The dependency anchor graph incorporates two kinds of syntactic information, constituency structure and dependency relations, to highlight the subject and verb phrase relation. This enhances coherencerelated aspects of representation. We design a neural model to learn a semantic representation for clauses from graph convolution over latent representations of the subject and verb phrase. We evaluate our method on two new datasets: a subset of a large corpus where the source texts are published novels, and a new dataset collected from students' essays. The results demonstrate a significant improvement over tree-based models, confirming the importance of emphasizing the subject and verb phrase. The performance gap between the two datasets illustrates the challenges of analyzing student's written text, plus a potential evaluation task for coherence modeling and an application for suggesting revisions to students.

Original languageEnglish (US)
Title of host publicationTextGraphs 2021 - Graph-Based Methods for Natural Language Processing, Proceedings of the 15th Workshop - in conjunction with the 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL 2021
EditorsAlexander Panchenko, Fragkiskos D. Malliaros, Varvara Logacheva, Abhik Jana, Dmitry Ustalov, Peter Jansen
PublisherAssociation for Computational Linguistics (ACL)
Pages54-66
Number of pages13
ISBN (Electronic)9781954085381
StatePublished - 2021
Event15th Workshop on Graph-Based Methods for Natural Language Processing, TextGraphs 2021 - Mexico City, Mexico
Duration: Jun 11 2021 → …

Publication series

NameTextGraphs 2021 - Graph-Based Methods for Natural Language Processing, Proceedings of the 15th Workshop - in conjunction with the 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL 2021

Conference

Conference15th Workshop on Graph-Based Methods for Natural Language Processing, TextGraphs 2021
Country/TerritoryMexico
CityMexico City
Period6/11/21 → …

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Computational Theory and Mathematics
  • Software
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Learning Clause Representation from Dependency-Anchor Graph for Connective Prediction'. Together they form a unique fingerprint.

Cite this