Romanian Google N-grams Filtering Tool
This Perl tool will automatically filter the Romanian n-grams from the Google Web 1T N-grams Corpus, 10 European Languages, Version 1. The output n-grams are in Romanian, with missing diacritics inserted.
People who looked at this resource also viewed the following:
- Hungarian Poem (János vitéz/John the Valiant by Sándor Petőfi) Reading Speech and Aligned Text Selection Database
- U-Compare E-txt2DB: Giving structure to unstructured data
- The Jensson Corpus
- Hungarian Book (Egri csillagok/Eclipse of the Crescent Moon by Géza Gárdonyi) Reading Speech and Aligned Text Selection Database