MyJournals Home  

RSS FeedsEntropy, Vol. 21, Pages 1201: Entropy Rate Estimation for English via a Large Cognitive Experiment Using Mechanical Turk (Entropy)

 
 

7 december 2019 01:00:12

 
Entropy, Vol. 21, Pages 1201: Entropy Rate Estimation for English via a Large Cognitive Experiment Using Mechanical Turk (Entropy)
 


The entropy rate h of a natural language quantifies the complexity underlying the language. While recent studies have used computational approaches to estimate this rate, their results rely fundamentally on the performance of the language model used for prediction. On the other hand, in 1951, Shannon conducted a cognitive experiment to estimate the rate without the use of any such artifact. Shannon’s experiment, however, used only one subject, bringing into question the statistical validity of his value of h = 1.3 bits per character for the English language entropy rate. In this study, we conducted Shannon’s experiment on a much larger scale to reevaluate the entropy rate h via Amazon’s Mechanical Turk, a crowd-sourcing service. The online subjects recruited through Mechanical Turk were each asked to guess the succeeding character after being given the preceding characters until obtaining the correct answer. We collected 172,954 character predictions and analyzed these predictions with a bootstrap technique. The analysis suggests that a large number of character predictions per context length, perhaps as many as 10 3 , would be necessary to obtain a convergent estimate of the entropy rate, and if fewer predictions are used, the resulting h value may be underestimated. Our final entropy estimate was h ≈ 1.22 bits per character.


 
269 viewsCategory: Informatics, Physics
 
Entropy, Vol. 21, Pages 1202: Nonasymptotic Upper Bounds on Binary Single Deletion Codes via Mixed Integer Linear Programming (Entropy)
Entropy, Vol. 21, Pages 1200: Nonlinear Heat Transport in Superlattices with Mobile Defects (Entropy)
 
 
blog comments powered by Disqus


MyJournals.org
The latest issues of all your favorite science journals on one page

Username:
Password:

Register | Retrieve

Search:

Physics


Copyright © 2008 - 2024 Indigonet Services B.V.. Contact: Tim Hulsen. Read here our privacy notice.
Other websites of Indigonet Services B.V.: Nieuws Vacatures News Tweets Nachrichten