Skip to content

Commit 3c512f3

Browse files
Update README.md
1 parent 50fc185 commit 3c512f3

File tree

1 file changed

+16
-1
lines changed

1 file changed

+16
-1
lines changed

README.md

Lines changed: 16 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,22 @@ If you want to contribute new feature/language support or found a text that is i
7676
5. Create a new Pull Request
7777

7878
## Citation
79-
TBD
79+
If you use `pysbd` package in your projects or research, please cite [PySBD: Pragmatic Sentence Boundary Disambiguation](https://www.aclweb.org/anthology/2020.nlposs-1.15).
80+
```
81+
@inproceedings{sadvilkar-neumann-2020-pysbd,
82+
title = "{P}y{SBD}: Pragmatic Sentence Boundary Disambiguation",
83+
author = "Sadvilkar, Nipun and
84+
Neumann, Mark",
85+
booktitle = "Proceedings of Second Workshop for NLP Open Source Software (NLP-OSS)",
86+
month = nov,
87+
year = "2020",
88+
address = "Online",
89+
publisher = "Association for Computational Linguistics",
90+
url = "https://www.aclweb.org/anthology/2020.nlposs-1.15",
91+
pages = "110--114",
92+
abstract = "We present a rule-based sentence boundary disambiguation Python package that works out-of-the-box for 22 languages. We aim to provide a realistic segmenter which can provide logical sentences even when the format and domain of the input text is unknown. In our work, we adapt the Golden Rules Set (a language specific set of sentence boundary exemplars) originally implemented as a ruby gem pragmatic segmenter which we ported to Python with additional improvements and functionality. PySBD passes 97.92{\%} of the Golden Rule Set examplars for English, an improvement of 25{\%} over the next best open source Python tool.",
93+
}
94+
```
8095

8196
## Credit
8297

0 commit comments

Comments
 (0)