Abstract: The dataset contains three collections of mainly literary Arabic texts from the 19th and early 20th centuries. corpus022_JurjiZaydan_Dated is a dated corpus of 22 historical novels by Jurj�� Zayd��n. It is well established that Jurj�� Zayd��n was publishing roughly one novel per year and the dates of publication are well known, which makes this corpus a valuable material for testing chronological changes in the style of individual writers. corpus065 is a corpus of 65 books by 8 authors; corpus300 contains 300 books by 28 authors...
(read more)