Describes methodological and technological approaches to corpus building and presents research based on the "Norwegian Newspaper Corpus". This book gives an overview of the corpus and its system architecture, and presents tools used for tasks such as text harvesting, annotation, topic classification and extraction and more.