Skip to content

Latest commit

 

History

History
18 lines (12 loc) · 640 Bytes

README.md

File metadata and controls

18 lines (12 loc) · 640 Bytes
	 STS Benchmark: Main English dataset
	 
	      (Translated to Swedish!)
		    
    Semantic Textual Similarity 2012-2017 Dataset

	    http://ixa2.si.ehu.eus/stswiki

Task: Given two sentences of text, s1 and s2, the systems need to compute how similar s1 and s2 are, returning a similarity score between 0 and 5. The dataset comprises naturally occurring pairs of sentences drawn from several domains and genres, annotated by crowdsourcing. See papers by Agirre et al. (2012; 2013; 2014; 2015; 2016; 2017).

Version 1.0 (2020/09/07):

Translated using Googles NMT API (No human correction of potential translation errors)