Abstract: Data gathered from social media have been used extensively to examine lexical dialect variation in widely used languages such as English and Spanish, but their use to date in morphosyntax and for lesser-used languages has been more limited. This paper tests the usefulness of using data derived from Twitter to address traditional questions in dialect syntax and sociolinguistics. It uses two cases studies from Welsh – the form of the second-person singular pronoun in various syntactic contexts, and the availability of auxiliary deletion – to ...
(read more)
Topics: 
Linguistics
Natural language processing