PHILLY:New In The City

07/30/2012: Mark-Up Tags

                We put of the first batch on Mechanical Turk. We didn't know how long it would take or how many people would be interested in taking this survey so we made a batch of 200 leads and wanted 10 responses to each one. The batch was complete within a day. We were not expecting so many people to take the survey and have it completed within 24 hours. This was a great relief because we thought we were asking too much questions and that there was too much work for too little money but that wasn't the case.

                We weren't expecting such a fast response so we didn't have any stats on the data yet. It will be interesting to see and hopefully something good will come out of it.

                On a different note. I got some translation mark-ups back from Ani! I have my one stuff to work on now. She only got through the first category. Noun phrased non-literal but still it's something to work on. She found so far that most of phrases aren't translating into Bulgarian. There are a couple that do but some of them are only getting the literal translation and not the literal.

                Ani created some mark-up categories to help label different types of translations errors:

T1---translated well, into an a literal expression that is not  word for word translation

T2---translated well into non-literal expression that is a word-for word translation

T3---garbled translation of the marked phrase

T4---garbled translation of  an unmarked phrase

T5---funny but understandable translation

T6---error related to incorrect word sense

T7---left untranslated 

So Ani did more than just mark if they were translated or not which was nice because it might be useful to have more information about the mark up than not. What is most important is that the labels T1, T2, and T5 correctly translated the non-literal phrase and T3 meant that it was not translated to the non-literal phrase.

                Yes, on a few occasions I may have missed a non-literal phrase and that is what T4 meant. However the majority of these phrases I marked as other when I did my markings. I did mark them, I just didn't know what category to put them in.

                I also got some translation mark up from Annie in Tamil. She found that most of the non-literal phrases didn't translate but even in the beginning it was different from language to language and it was based a lot on culture. That is where these phrases begin, right? This language was a little different because right off the bat Annie said that because the sentence structure is so different between Tamil and English the Google translations were as helpful as they were to Bulgarian.