wiki:WikiStart

Welcome to the CLARIN-NL Wiki

In this section, generic information on Clarin topics can be found. Currently, collections of links to relevant documentation are presented with a short description of its content. This information is presented in chapters that reflect the division in Clarin topics made elsewhere:

The Clarin-NL helpdesk also has a Frequently Asked Questions section. Any requests for changes or additions can be submitted by e-mailing the Clarin-NL helpdesk.

Metadata


http://www.clarin.eu/files/wg2-4-metadata-doc-v5.pdf
Title: Metadata Infrastructure for Language Resources and Technology
Date/Version?: 2009-02-04 - Version 5
Content: This document gives an overview about how metadata descriptions are used until now, what the deficits of the current infrastructures are and which lessons we as community learned from about a decade of experience. Based on this the requirements for a new CLARIN approach are being worked out. This document will be discussed in the appropriate working groups and in the Executive Board. It will be subject of regular adaptations dependent on the progress in CLARIN.

http://trac.clarin.nl/trac/attachment/wiki/WikiStart/BestPracticeGuide-V4.pdf
Title: Best Practice Guide for using CLARIN metadata components
Date/Version?: 2010
Content: The Dutch CLARIN project “Creating and using CLARIN metadata components” was the first to actually test the use of components and to try to create metadata descriptions for resources available in two Dutch language resource centers: the Institute for Dutch Lexicology (INL) and the Meertens Institute. This “Best Practice Guide” is the result of this project. It will however in the future be extended with new experiences gained by new projects that will make use of the CMDI.

http://www.clarin.eu/files/metadata-CLARIN-ShortGuide.pdf
Title: Component Metadata
Date/Version?: 2009-02
Content: In this A4 shortguide, some introductory information is provided on Component Metadata following the generic shortguide layout of "What is it?", "What is it for?", "Who can use it?", "When can it be used?" and "How does it work?".

http://www.clarin.nl/system/files/clarin-md-component.pdf
Title: Metadata and DCR
Date/Version?: 2010-03-25
Content: In this presentation given at an ISOcat workshop in Utrecht, shortcomings were discussed in "traditional metadata" and benefits of "component metadata". To guarantee interoperability while using different components, data categories are discussed. Furthermore, a bigger picture is provided through some diagrams and finally, an overview of building and using components in practice is provided.

http://www.clarin.nl/system/files/LREC2010_Metadataproject_FdV-v1.1.ppt
Title: Creating & Testing CLARIN Metadata Components - A CLARIN-NL project
Date/Version?: 2010-05-18
Content: This presentation was given at LREC in Malta and featured the topics "What is CMDI?", "What is the goal of our project?", "How to go from a resource to harvestable metadata?" and "Findings of the project and future challenges". This presentation has an accompanying text document that can be found here.

Arbil

Arbil can be found at http://www.lat-mpi.eu/tools/arbil

http://www.mpi.nl/corpus/a4guides/a4-guide-arbil.pdf
Title: ARBIL
Date/Version?: 2009-11
Content: This A4 guide features practical information on: "1. Starting ARBIL", "2. Getting your metadata"", "3. Changing your metadata"and "4. Saving and exporting your metadata".

http://www.mpi.nl/corpus/manuals/manual-arbil_ug.pdf
Title: Arbil User Guide
Date/Version?: 2010-02-17
Content: This extensive user guide contains user oriented information on Arbil and its usage, featuring lots of details and screenshots.

http://www.mpi.nl/corpus/manuals/manual-arbil.pdf
Title: Arbil for creating IMDI-corpora metadata
Date/Version?: 2009-12
Content: This is the official Arbil manual, featuring lots of details and screenshots.

http://www.clarin.nl/system/files/ArbilClarin.pdf
Title: ARBIL, the CMDI metadata editor
Date/Version?: 2010-05-26
Content: In this document, information on Arbil and its features is provided and visualized through screenshots. In more detail, the following topics are addressed in this document: "generic information on Arbil and its history", "a description of Arbil as an XML editor", "the specialized functions of Arbil", "profiles in Arbil", "the construction of metadata files", "entering and organizing data", "searching and visualizing the data", ''the Arbil forum'' and "installing Arbil".

http://www.mpi.nl/tg/j2se/jnlp/linorg/ArbilPresentation20091015.pdf
Title: Arbil Presentation
Date/Version?: 2009-10-15
Content: This is a nice presentation of Arbil, featuring 20 sheets of information and many screenshots.

Component Registry

The Component Registry can be found at http://www.clarin.eu/cmdi

http://www.clarin.eu/system/files/cmdi_isocat_Paper.pdf
Title: A Data Category Registry- and Component-based Metadata Framework
Date/Version?: 2010-05-19
Content: We describe our computer-supported framework to overcome the rule of metadata schism. It combines the use of controlled vocabularies, managed by a data category registry, with a component-based approach, where the categories can be combined to yield complex metadata structures. A metadata scheme devised in this way will thus be grounded in its use of categories. Schema designers will profit from existing prefabricated larger building blocks, motivating re-use at a larger scale. The common base of any two metadata schemes within this framework will solve, at least to a good extent, the semantic interoperability problem, and consequently, further promote systematic use of metadata for existing resources and tools to be shared.

http://www.clarin.eu/system/files/ComponentRegistryReferenceManual.pdf
Title: Component Registry and Browser Reference Manual.
Date/Version?: Unknown
Content: In this manual, features of the Component Registry are highlighted, providing information and screenshots of them. The following features are covered in this manner: "1) Register and store CMDI Components/Profiles?", "2) Enable a user to browse the registered Components/Profiles?" and "3) Enable a user to edit and create Components/Profiles?".

http://www.clarin.nl/system/files/CLARIN_ComponentRegistry.pdf
Title: CMDI Component Registry
Date/Version?: 2010
Content: This is a presentation of the Component Registry, full of visual information about its characteristics.

Go Up

ISOcat

ISOcat can be found at http://www.isocat.org

http://www.isocat.org/files/manual.html
Title: Manual
Date/Version?: This page is updated regularly
Content: This page contains links to guides on how to use ISOcat and the Data Category Registry in general, as they gradually come available. It is a very good place to find information about ISOcat.

http://www.clarin.eu/files/concept_registry-CLARIN-ShortGuide.pdf
Title: Concept Registry Service
Date/Version?: 2009-02
Content: In this A4 shortguide, some introductory information is provided on the Concept Registry Service following the generic shortguide layout of "What is it?", "What is it for?", "Who can use it?", "When can it be used?" and "How does it work?".

http://pubman.mpdl.mpg.de/pubman/item/escidoc:131099:4/component/escidoc:131101/Kemps_Snijders_ISOcat_IMSO_2009.pdf
Title: ISOcat: remodelling metadata for language resources
Date/Version?: 2009
Content: Abstract: The Max Planck Institute for Psycholinguistics in Nijmegen, The Netherlands, is creating a state-of-the-art web environment for the ISO TC 37 (terminology and other language and content resources) metadata registry. This Data Category Registry (DCR) is called ISOcat and encompasses data categories for a broad range of language resources. Under the governance of the DCR Board, ISOcat provides an open work space for creating data category specifications, defining Data Category Selections (DCSs) (domain-specific groups of data categories), and standardising selected data categories and DCSs. Designers visualise future interactivity among the DCR, reference registries and ontological knowledge spaces.

http://www.clarin.nl/system/files/ISOcat-20100208.pdf
Title: ISOcat A short introduction
Date/Version?: 2010-02-08
Content: This is a presentation given at a Clarin-NL meeting. Among other things also presented in the next mentioned presentation, it features information on the status of ISOcat at that moment in time.

http://www.clarin.nl/system/files/ISOcat-introduction.pdf
Title: ISO 12620 Data Category Registry An introduction
Date/Version?: 2010-03-25
Content: In this presentation given at an ISOcat workshop in Utrecht, information was offered about standardization, data categories and their models.

http://www.clarin.eu/system/files/cmdi_isocat_Paper.pdf
Title: A Data Category Registry- and Component-based Metadata Framework
Date/Version?: 2010-05-19
Content: We describe our computer-supported framework to overcome the rule of metadata schism. It combines the use of controlled vocabularies, managed by a data category registry, with a component-based approach, where the categories can be combined to yield complex metadata structures. A metadata scheme devised in this way will thus be grounded in its use of categories. Schema designers will profit from existing prefabricated larger building blocks, motivating re-use at a larger scale. The common base of any two metadata schemes within this framework will solve, at least to a good extent, the semantic interoperability problem, and consequently, further promote systematic use of metadata for existing resources and tools to be shared.

Go Up

PIDs

http://www.pidconsortium.eu/
Title: EPIC home
Date/Version?: This website is regularly maintained
Content: Since the beginning of 2009 GWDG runs on behalf of the Max Planck Society a PID service, based on the handle system (TM, http://www.handle.net/ ), for the allocation and resolution of persistent identifiers. Together with other European partners a consortium was build to provide this services to the European research community.

http://www.clarin.eu/files/pid-CLARIN-ShortGuide.pdf
Title: Persistent Identifier Service
Date/Version?: 2009-02
Content: In this A4 shortguide, some introductory information is provided on the Persistent Identifier Service following the generic shortguide layout of "What is it?", "What is it for?", "Who can use it?", "When can it be used?" and "How does it work?".

http://www-sk.let.uu.nl/u/D2R-2b.pdf
Title: Federation Foundation Persistent and unique Identifiers
Date/Version?: 2009-02-04 - Version 5
Content: This document describes the goals and requirements of a registration and resolution system for persistent and unique resource identifiers that could be used by all CLARIN members and beyond, i.e. a functioning system could be used by other communities as well and there is great interest. Stepwise all CLARIN centers would need to introduce PIDs to come to a proper landscape of resources where various instances can and will be created at various places. This document will be discussed in the appropriate working groups and in the Executive Board. It will be subject of regular adaptations dependent on the progress in CLARIN.

http://www.clarin.nl/system/files/eric-pid-clarin-info-day.pdf
Title: Persistent Identifiers for Language Resources
Date/Version?: 2009-06
Content: This presentation was given at the CLARIN Info Day. It discusses the questions "Why the Handle System?" and "What does it mean?" and gives examples.

Go Up

Webservices

http://www-sk.let.uu.nl/u/D2R-6b.pdf
Title: Requirements Specification Web Services and Workflow Systems
Date/Version?: 2010-01-12 - Version 2
Content: This document describes the goals and requirements of web services and workflow systems that could be used by all CLARIN members and beyond, i.e. a functioning system could be used by other communities as well. Stepwise all CLARIN centers would need to introduce these requirements in their operational environment to come to a proper landscape of resources, services and tools where various instances can and will be created/operated at various places. This document will be discussed in the appropriate working groups and in the Executive Board. It will be subject of regular adaptations dependent on the progress in CLARIN.

http://www.clarin.eu/system/files/SOA-CLARIN-ShortGuide.pdf
Title: Service Oriented Infrastructure
Date/Version?: 2010-02
Content: In this A4 shortguide, some introductory information is provided on the Service Oriented Infrastructure following the generic shortguide layout of "What is it?", "What is it for?", "Who can use it?", "When can it be used?" and "How does it work?".

http://www.clarin.eu/system/files/ws_interop-CLARIN-ShortGuide.pdf
Title: Web Services Interoperability
Date/Version?: 2010-02
Content: In this A4 shortguide, some introductory information is provided on Web Services Interoperability following the generic shortguide layout.

http://www.clarin.nl/system/files/marc-CLARIN-NL.pptx
Title: CLARIN web services and workflow
Date/Version?: unknown
Content: This presentation covers topics like the registration and workflow of webservices, the architecture of webservices and formats and the process of standardization.

http://ilk.uvt.nl/clam/
Content: This webpage features information on CLAM (Computational Linguistics Application Mediator). CLAM allows you to quickly and transparently transform your Natural Language Processing application into a RESTful webservice, with which both human end-users as well as automated clients can interact.

Go Up

Formats and standards

http://www.iso.org/iso/standards_development/processes_and_procedures/stages_description.htm
Title: Stages of the development of International Standards
Date/Version?: This website is maintained regularly
Content: This official ISO website describes the stages involved in standardization.

http://www.clarin.nl/system/files/Standards%20for%20LRT-v6.pdf
Title: Standards for LRT
Date/Version?: 2009-01
Content: This document is the basis for a joint web-site with recommendations for CLARIN. Each known name of a standard or best-practice guideline will be commented along a few criteria

http://www.clarin.eu/files/standards-CLARIN-ShortGuide.pdf
Title: Standards and Best Practices
Date/Version?: 2009-02
Content: In this A4 shortguide, some introductory information is provided on Standards and Best Practices following the generic shortguide layout.

http://www.clarin.eu/files/standards-text-CLARIN-ShortGuide.pdf
Title: Standards for Text Encoding
Date/Version?: 2009-05
Content: In this A4 shortguide, some introductory information is provided on Standards for Text Encoding following the generic shortguide layout.

http://www.clarin.nl/system/files/CLARIN-NL%20formats.pptx
Title: Formats, interoperability and standards
Date/Version?: unknown
Content: This presentation covers format interoperability, the process of standardization, pivot formats and community practices.

Go Up

AAI

http://www-sk.let.uu.nl/u/D2R-2a.pdf
Title: Federation Foundation for Language Resource and Technology
Date/Version?: 2009-02-04 - Version 7
Content: This document describes the requirements for the Language Resource and Technology Federation that CLARIN wants to build up based on a stable network of centers as described in CLARIN-1-2008 and CLARIN- 3-2008. It is also referring to a detailed discussion of possible solutions for persistent identifiers as described in CLARIN-2/2008. This document will be discussed in the appropriate working groups and in the Executive Board. It will be subject of regular adaptations dependent on the progress in CLARIN. In chapter 1 it is explained why federation technology is an issue for a research infrastructure as CLARIN. In chapter 2 we will discuss various models of federations, distinguish identity and service provider federations and describe a few pillars federations need to have. In chapter 3 we will describe the technologies required to implement a CLARIN federation and in chapter 4 the major middleware components are introduced to establish a distributed authentication and authorization domain. In chapter 5 we summarize the requirements relevant for CLARIN and in chapter 6 we outline the procedural approach.

http://www.clarin.eu/system/files/CLARIN_Service_Provider_Start-Up_Federation_Agreement_Final.pdf
Title: CLARIN SERVICE PROVIDER FEDERATION START-UP AGREEMENT
Date/Version?: unknown
Content: This is the formal start-up agreement for the Clarin service provider federation.

http://www.clarin.nl/system/files/federation-dieter.ppt
Title: Authorization and Authentication Infrastructure
Date/Version?: 2009-07-01
Content: This presentation was held at the CLARIN-NL Info Session in Nijmegen. It features "CLARIN and the holy grail", "Traditional Federations", "AAI prototype" and "Planning".

https://spaces.internet2.edu/display/SHIB2/Home
Title: Shibboleth home
Date/Version?: This website is updated regularly
Content: This is the homepage of Shibboleth 2, a community-maintained repository for deployment, configuration, and production information. Shibboleth allows users to securely send trusted information about themselves to remote resources. This information may then be used for authentication, authorization, content personalization, and enabling single sign-on across a broad range of services from many different providers.

http://www.switch.ch/aai/index.html
Title: SWITCHaai – the key that connects students and the university
Date/Version?: This website is updated regularly
Content: This is a web page where information on AAI is offered in several layers of complexity. Very useful here are the simple, medium and expert demo that are offered here

http://aai.kuleuven.be/
Title: AAI > Shibboleth
Date/Version?: This website is updated regularly
Content: This is a web page in Dutch, where information on AAI is offered. In one of its sub pages (here), a nice diagram is presented to visualize the process of accessing a shibbolized service. Some of this information in English can be found in a presentationhere.

CLARIN-compatible

http://trac.clarin.nl/trac/attachment/wiki/WikiStart/CLARIN%20compatible%20NL%20110805.pdf

This document defines the notion 'CLARIN-compatible' (in Dutch).

The English version is here: https://trac.clarin.nl/attachment/wiki/WikiStart/CLARIN%20compatible%20EN%20131015.pdf

Go Up

Last modified 4 years ago Last modified on 10/17/2013 09:09:48 AM

Attachments (3)

Download all attachments as: .zip