Patterns of Markup use in Wikipedia

Wikipedia is a knowledge building community that lets anyone create and edit articles. While editing articles, users employ visual structure elements (VSE) to format content. VSEs are part of the Wikipedia markup language. All creation and editing events are recorded in a revision history. An unsupe...

Descripción completa

Detalles Bibliográficos
Autores principales: Martin, Jonathan, Torres, Diego, Fernández, Alejandro
Formato: Objeto de conferencia
Lenguaje:Inglés
Publicado: 2017
Materias:
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/155554
Aporte de:
id I19-R120-10915-155554
record_format dspace
spelling I19-R120-10915-1555542023-07-13T20:08:32Z http://sedici.unlp.edu.ar/handle/10915/155554 isbn:978-1-5386-3483-7 Patterns of Markup use in Wikipedia Martin, Jonathan Torres, Diego Fernández, Alejandro 2017-10 2017 2023-07-13T17:52:15Z en Ciencias Informáticas Pattern mining Machine learning Unsupervised learning Wikipedia Wikipedia is a knowledge building community that lets anyone create and edit articles. While editing articles, users employ visual structure elements (VSE) to format content. VSEs are part of the Wikipedia markup language. All creation and editing events are recorded in a revision history. An unsupervised learning approach was used to analyze a dataset with more than 2,000,000 revisions of 126,000 articles. Using K-Means clustering and association rules mining a general classification of revisions was derived. Relevant classes include vandalism revisions, correction revisions and common revisions. Each class was later studied, and patterns of usage of markups elements identified. Those results help to identify the user intention, and the knowledge of VSE use could contribute to improving the actual text editors provide by Wikipedia to improve the editor’s activity finally. Laboratorio de Investigación y Formación en Informática Avanzada Objeto de conferencia Objeto de conferencia http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) application/pdf
institution Universidad Nacional de La Plata
institution_str I-19
repository_str R-120
collection SEDICI (UNLP)
language Inglés
topic Ciencias Informáticas
Pattern mining
Machine learning
Unsupervised learning
Wikipedia
spellingShingle Ciencias Informáticas
Pattern mining
Machine learning
Unsupervised learning
Wikipedia
Martin, Jonathan
Torres, Diego
Fernández, Alejandro
Patterns of Markup use in Wikipedia
topic_facet Ciencias Informáticas
Pattern mining
Machine learning
Unsupervised learning
Wikipedia
description Wikipedia is a knowledge building community that lets anyone create and edit articles. While editing articles, users employ visual structure elements (VSE) to format content. VSEs are part of the Wikipedia markup language. All creation and editing events are recorded in a revision history. An unsupervised learning approach was used to analyze a dataset with more than 2,000,000 revisions of 126,000 articles. Using K-Means clustering and association rules mining a general classification of revisions was derived. Relevant classes include vandalism revisions, correction revisions and common revisions. Each class was later studied, and patterns of usage of markups elements identified. Those results help to identify the user intention, and the knowledge of VSE use could contribute to improving the actual text editors provide by Wikipedia to improve the editor’s activity finally.
format Objeto de conferencia
Objeto de conferencia
author Martin, Jonathan
Torres, Diego
Fernández, Alejandro
author_facet Martin, Jonathan
Torres, Diego
Fernández, Alejandro
author_sort Martin, Jonathan
title Patterns of Markup use in Wikipedia
title_short Patterns of Markup use in Wikipedia
title_full Patterns of Markup use in Wikipedia
title_fullStr Patterns of Markup use in Wikipedia
title_full_unstemmed Patterns of Markup use in Wikipedia
title_sort patterns of markup use in wikipedia
publishDate 2017
url http://sedici.unlp.edu.ar/handle/10915/155554
work_keys_str_mv AT martinjonathan patternsofmarkupuseinwikipedia
AT torresdiego patternsofmarkupuseinwikipedia
AT fernandezalejandro patternsofmarkupuseinwikipedia
_version_ 1771439110706167808