Menu
Mon panier

En cours de chargement...

Recherche avancée

Building and Exploring Web Corpora - Proceedings of the 3rd web as corpus workshop, incorporating cleaneval (Broché)

Edition en anglais

Cédrick Fairon, Hubert Naets, Adam Kilgarriff, Gilles-Maurice de Schryver

Collectif

  • Presses Universitaires Louvain

  • Paru le : 01/01/2007
WAC More and more people are using Web data for linguistic and NLP research. The Web as Corpusworkshop (WAC) provides a venue for exploring how we can... > Lire la suite
  • Plus d'un million de livres disponibles
  • Retrait gratuit en magasin
  • Livraison à domicile sous 24h/48h*
    * si livre disponible en stock, livraison payante
19,70 €
Expédié sous 8 à 17 jours
  • ou
    À retirer gratuitement en magasin U
    entre le 2 décembre et le 11 décembre
WAC More and more people are using Web data for linguistic and NLP research. The Web as Corpusworkshop (WAC) provides a venue for exploring how we can use it effectively and the advancementsto which this could lead. This book is a collection of the talks presented at the 3 rd WAC in Louvain-la-Neuve (Belgium). The focus is on the description of Web corpus collection projects, the exploration of Web datacharacteristics from a linguistics/NLP perspective, and on the use of crawled Web data for NLPpurposes.
CLEANEVAL Any use of Web data requires that it be cleaned in order to get rid of unwanted material including, for example, HTML markup, navigation bars, advertisements. To date there has been no sharingof resources or expertise in this particular domain and the cleaning has often been done minimally. Cleaneval was an exercise aimed at promoting collaboration and improving our understandingof the issues.
Results and perspectives are presented in this book.

Fiche technique

  • Date de parution : 01/01/2007
  • Editeur : Presses Universitaires Louvain
  • Collection : Cahiers du CENTAL
  • ISBN : 978-2-87463-082-8
  • EAN : 9782874630828
  • Présentation : Broché
  • Nb. de pages : 182 pages
  • Poids : 0.51 Kg
  • Dimensions : 16,0 cm × 24,0 cm × 1,0 cm

À propos des auteurs

Cédrick Fairon est professeur à l'Université catholique de Louvain où il dirige le Centre de traitement automatisé du langage (CENTAL).

Building and Exploring Web Corpora - Proceedings of the 3rd web as corpus workshop, incorporating cleaneval est également présent dans les rayons

Cédrick Fairon et Hubert Naets - Building and Exploring Web Corpora - Proceedings of the 3rd web as corpus workshop, incorporating cleaneval.
Building and Exploring Web Corpora. Proceedings of the...
19,70 €
Haut de page