SEthesaurus manual-check dataset

Any automatic algorithm cannot reach 100% accuracy. So we are now manually checking the abbreviations and synonyms obtained from our algorithm. It is an ongoing work, and we have so far checked more than 3,000 groups of them and list them in the file below.

manually checked abbreviations & synonyms

If you use this dataset for your work, please cite the paper below:

		  
@inproceedings{chen2017synonym,
	title={Unsupervised Software-Specific Morphological Forms Inference from Informal Discussions},
	author={Chen, Chunyang and Xing, Zhenchang and Wang Ximing},
	booktitle={The 39th International Conference on Software Engineering, Buenos Aires, Argentina},
	year={2017},
	organization={IEEE}
	}