CNeSp

The Chinese Negation and Speculation (abbr., CNeSp) corpus consists of scientific literature, product reviews, and financial articles annotated for negation, speculation and their linguistic scope. This was done to allow a comparison between the development of systems for negation/speculation identification. The corpus is publicly available for research purposes.

Please cite if you make use of the corpus:
Bowei Zou, Qiaoming Zhu, and Guodong Zhou. Negation and Speculation Identification in Chinese Language. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL'2015), pages 656-665.

[annotation guidelines]
Annotation principles are also discussed in the following paper:
[articles]

download

The corpus consists of texts taken from 3 different sources in order to ensure that it captures the heterogeneity of language.

Scientific Literature: 19 articles from Chinese Journal of Computers (Vol.35(11)).
Financial Article: 311 articles from "timely rain for stock market" column of Sina.com in April, 2013.
Product Review: 821 comments of hotel service from Ctrip.com.

Full Corpus v1.2