Terminology, corpora and language resources

Terminology, corpora and language resources

This research area covers a range of projects within terminology and corpus linguistics, applied linguistics and specialised communication. Our research pertains especially to topics relevant for economics, business administration and law.

We have established a unique environment for research in specialised communication in Norway, the Nordic region and Europe.

We are active in developing monolingual and multilingual language resources and utilising them in empirical research within specialised communication. Most of these resources are available to the public for a variety of purposes. 

The research comprises both theoretical and applied approaches and utilises a broad spectrum of language resources: 
  • Research on strategic and political communication
  • Research on specialised translation
  • Statistical approaches to textual data analysis
  • Research on word and term formation, borrowing and neology, and linguistic creativity
  • Research on communication about migration and poverty
  • Popular science and news research
  • The Terminology Portal – a national infrastructure for terminology in Norway

 

Ongoing research projects

  • Clarino and Termportalen

    Clarino and Termportalen

    CLARINO (Common Language Resources and Technology Infrastructure: Norway) creates a national infrastructure for language resources in Norway. The project is funded by the Research Council of Norway and linked to the Europe-wide CLARIN project. We are developing Termportalen – a national portal for terminology resources. This effort answers to the need for coordination of terminological efforts at national and international levels. 

    For more information:

    Termportalen på Uib.no

    Clarino

  • Clarin

    Clarin

    CLARIN (Common Language Resources and Technology Infrastructure) is a Europe-wide research infrastructure that aims to provide easy and sustainable access, for scholars in the humanities and social sciences, to digital language data and advanced tools, independent of where they are located. We are a member of the CLARIN network through our active involvement in the CLARA and CLARINO projects.

    For more informaton:

    CLARIN

  • NHH Termbase

    NHH Termbase

    NHH TERMBASE is a terminological database covering a wide range of concepts within economics and business administration and related areas. It consists of freely available and updated terminology in Norwegian and English. 

    For more information: 

    NHH termbase

  • ISO/TC 37 and SN/K 144 LANGUAGE AND TERMINOLOGY

    ISO/TC 37 and SN/K 144 LANGUAGE AND TERMINOLOGY

    ISO/TC 37 and SN/K 144 LANGUAGE AND TERMINOLOGY 

    We are a member of the ISO Technical Committee 37 “Language and Terminology” and its Norwegian mirror committee SN/K 144 “Språk og terminologi”. The committees contribute in standardisation of descriptions, resources, technologies and services related to terminology, translation, interpreting and other language-based activities in the multilingual information society.

    For more information:

    ISO/TC 27

    SN/K 144

  • Maritim Ordbok

    Maritim Ordbok

    MARITIM ORDBOK is an externally funded project that develops a comprehensive terminology for maritime domains. The resource contains a wide range of concepts in Norwegian, English and other languages and covers all maritime areas including fauna, flora, marine industries, products, tools and equipment.

    For more information: 

    MARITIM ORDBOK

  • The Forskning.no corpus

    The Forskning.no corpus

    The Forskning.no corpus is a large Norwegian corpus of popular science from all areas collected from the news service Forskning.no. It is a valuable resource for research on popular science as a genre, for the study of terminology and language for specific purposes, as well as for applications such as term extraction, ontology building, etc. The corpus is accessible via the Corpuscle infrastructure in CLARINO (login required).

    Fot more information:

    THE FORSKNING.NO CORPUS

  • GLAD - GLOBAL ANGLICISM DATABASE NETWORK

    GLAD - GLOBAL ANGLICISM DATABASE NETWORK

    GLAD - GLOBAL ANGLICISM DATABASE NETWORK is an international network of scholars working on linguistic and cultural Anglicization within and across language communities worldwide. Among its specific objectives are producing an online global database of Anglicisms and investigating theoretical issues related to language contact with English and related phenomena.

    For more information:

    GLAD

  • THE FOMC CORPUS

    THE FOMC CORPUS

    The Federal Open Market Committee (FOMC) Corpus facilitates research on the deliberations on monetary policy in the United States from meeting session of the board of the Fed. It consists of transcripts of the FOMC’s meetings from 2002 onwards. It is accessible via the Sketch Engine (login required).

    For more information:

    THE FOMC CORPUS

  • Migration and the Media

    Migration and the Media

    MIGRATION AND THE MEDIA is a corpus-based research project which studies how media outlets, particularly newspapers, linguistically frame migrants and migration. The automated retrieval of linguistic features allows for a reliable analysis of semantic and pragmatic properties assigned to migrants across different newspaper articles. One of the main aims of this project is to ascertain whether migrants are presented in a way that influences their process of integration into their host societies. 

  • NO-JU TERMBASE

    NO-JU TERMBASE

    The NO-JU TERMBASE is a bilingual termbase covering a wide range of legal concepts from the Norwegian legal system and their counterparts in the German legal system. The freely available and updated terminology facilitates translation between Norwegian and German in legal domains by providing definitions, relevant context, synonyms and near-synonyms.

Completed research projects

  • Mikroøkonomen

    Mikroøkonomen

    The Mikroøkonomen terminology project developed a termbase consisting of 800 term records in the field of microeconomics (English-Norwegian).

    This has become a useful resource for students, lecturers and researchers of economics to bridge the gap between the literature written in English and the users’ need to acquire equal competence in Norwegian. Mikroøkonomen has since been further developed in the project NHH Termbase.

    The Mikroøkonomen project (completed 2008) received funding from the Research Council of Norway and NHH.

    For more information:
    Mikroøkonomen

  • KB-N KNOWLEDGE BANK OF NORWAY

    KB-N KNOWLEDGE BANK OF NORWAY

    The KB-N project developed a concept-oriented text and term based knowledge management system for economic-administrative domains.

    The project included language technology applications for use primarily within translation, documentation and publishing. It incorporated a 30-million-words parallel corpus of Norwegian and English economic-administrative texts from about 30 subdomains, as well as a bilingual termbase of some 8,400 term records. As part of the project an automatic term extractor module for Norwegian was developed.

    The 3-year KB-N project (completed 2006) was funded by the Research Council of Norway within its KUNSTI programme for language technology.

  • CLARA

    CLARA

    CLARA (Common languange resources and their applications) was an EU-funded project that carried out research on theoretical, methodological and technical topics relating to the task of harmonising language resources and terminology for professional domains.

    Professional knowledge domains, such as economy, energy and medicine, present special challenges to correct understanding, especially across languages.

    In this project a large corpus of English and Spanish Free Trade Agreements was compiled and experiments were performed with semi-automatic extraction of term candidates and specialised collocation candidates.

    The work focused specifically on parallel corpora, computational terminology and phraseology.

    For further information: 
    CLARA