My experience with MTurk is that 3 isn't enough runs if you need the data to be ...

ig1 · on Sept 7, 2018

You should consider using qualifications / simplifying the requests.

The error rate I get for data entry tasks is around 0.5%-1% discrepancy between double entry. If you use prior reliability of the worker to tie break between who's right it drops to <0.1% error rate.

imhoguy · on Sept 7, 2018

Does MTurk API allow to identify, rank and exclude workers? By identifying I mean get some common key for all given worker submissions etc.

RosanaAnaDana · on Sept 7, 2018

I mean, this is an issue in any annotation exercise. Most annotation work heads south due to a failure to create a entire, discrete and complete workflow/ classification.