Working with ANNIS has been a new experience for me, as I haven’t closely worked with corpus searches before. The interface was a bit overwhelming and confusing at first, but with some visual instructions it was easier to navigate the…
The Proper Noun Problem – Working with ANNIS
Working with the software ANNIS and the corpus consisting of our novels’ example sentences, one of the predictions we had at the beginning of term came true: Machine annotation is indeed faulty and there is a tendency to classify non-English…
Working with ANNIS on multilingual Sentences – Experience and Analysis
Set up and first impressions I found working with ANNIS both fun and insightful. I especially appreciated that query results displayed annotations and sentence dependencies immediately, unlike in Google Collab, where multiple intermediate steps were needed. This intuitive interface made…
Working with ANNIS for the First Time
Working with our corpus and ANNIS was an interesting experience because it gave us the first overview of how our corpus looks and how we can use it to make deductions about our non-English novels as a whole. Playing around…
ANNIS – Much Potential, Hindered by Machine Annotation
Experience Using ANNIS After getting used to the surface, I found ANNIS to be very user-friendly. The Query-Builder allows for a combination of all kinds of prompts. In theory, this could be used to find out how many non-English words…
Analysing multilingual sentences with ANNIS
For me, working with ANNIS was much more fun than annotating sentences in Google Collab. I liked actually being able to get some quantifications out of the sentences we annotated. Although in the end the corpus we uploaded on ANNIS…
Using ANNIS as a Tool in Analysing Multilingual Sentences
To me, using ANNIS to study and interpret the annotations of our collective corpus was fun. But only after figuring out the limits and possibilities that diffeI focused on one particular area. It was possible to discover different patterns in…
ANNIS or Google Collab?
Well, they both are terribly bad at recognizing „foreign“ words (which basically means, any non-English words) in Multilingual structures where different languages are combined to create sentences. Honestly, I expected ANNIS to do a better job at annotating the multilingual…
Our corpus
Parts of speech Our corpus consists of 681 non-English and 3405 English words, meaning 4086 words in total. Here are some of the distributions as they were classified by the machine: Part of speech Total amount of part of speech…
Interpretation and Analysis (with reference to example sentences from Owuor’s novel Dust)
In class, we developed the hypothesis that the inclusion of nouns from a foreign language into a post-monolingual text might in general be more likely than the inclusion of other parts of speech. Our reasoning was that we deem their…