Patterns of Markup use in Wikipedia
Wikipedia is a knowledge building community that lets anyone create and edit articles. While editing articles, users employ visual structure elements (VSE) to format content. VSEs are part of the Wikipedia markup language. All creation and editing events are recorded in a revision history. An unsupe...
Autores principales: | , , |
---|---|
Formato: | Objeto de conferencia |
Lenguaje: | Inglés |
Publicado: |
2017
|
Materias: | |
Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/155554 |
Aporte de: |
id |
I19-R120-10915-155554 |
---|---|
record_format |
dspace |
spelling |
I19-R120-10915-1555542023-07-13T20:08:32Z http://sedici.unlp.edu.ar/handle/10915/155554 isbn:978-1-5386-3483-7 Patterns of Markup use in Wikipedia Martin, Jonathan Torres, Diego Fernández, Alejandro 2017-10 2017 2023-07-13T17:52:15Z en Ciencias Informáticas Pattern mining Machine learning Unsupervised learning Wikipedia Wikipedia is a knowledge building community that lets anyone create and edit articles. While editing articles, users employ visual structure elements (VSE) to format content. VSEs are part of the Wikipedia markup language. All creation and editing events are recorded in a revision history. An unsupervised learning approach was used to analyze a dataset with more than 2,000,000 revisions of 126,000 articles. Using K-Means clustering and association rules mining a general classification of revisions was derived. Relevant classes include vandalism revisions, correction revisions and common revisions. Each class was later studied, and patterns of usage of markups elements identified. Those results help to identify the user intention, and the knowledge of VSE use could contribute to improving the actual text editors provide by Wikipedia to improve the editor’s activity finally. Laboratorio de Investigación y Formación en Informática Avanzada Objeto de conferencia Objeto de conferencia http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) application/pdf |
institution |
Universidad Nacional de La Plata |
institution_str |
I-19 |
repository_str |
R-120 |
collection |
SEDICI (UNLP) |
language |
Inglés |
topic |
Ciencias Informáticas Pattern mining Machine learning Unsupervised learning Wikipedia |
spellingShingle |
Ciencias Informáticas Pattern mining Machine learning Unsupervised learning Wikipedia Martin, Jonathan Torres, Diego Fernández, Alejandro Patterns of Markup use in Wikipedia |
topic_facet |
Ciencias Informáticas Pattern mining Machine learning Unsupervised learning Wikipedia |
description |
Wikipedia is a knowledge building community that lets anyone create and edit articles. While editing articles, users employ visual structure elements (VSE) to format content. VSEs are part of the Wikipedia markup language. All creation and editing events are recorded in a revision history. An unsupervised learning approach was used to analyze a dataset with more than 2,000,000 revisions of 126,000 articles. Using K-Means clustering and association rules mining a general classification of revisions was derived. Relevant classes include vandalism revisions, correction revisions and common revisions. Each class was later studied, and patterns of usage of markups elements identified. Those results help to identify the user intention, and the knowledge of VSE use could contribute to improving the actual text editors provide by Wikipedia to improve the editor’s activity finally. |
format |
Objeto de conferencia Objeto de conferencia |
author |
Martin, Jonathan Torres, Diego Fernández, Alejandro |
author_facet |
Martin, Jonathan Torres, Diego Fernández, Alejandro |
author_sort |
Martin, Jonathan |
title |
Patterns of Markup use in Wikipedia |
title_short |
Patterns of Markup use in Wikipedia |
title_full |
Patterns of Markup use in Wikipedia |
title_fullStr |
Patterns of Markup use in Wikipedia |
title_full_unstemmed |
Patterns of Markup use in Wikipedia |
title_sort |
patterns of markup use in wikipedia |
publishDate |
2017 |
url |
http://sedici.unlp.edu.ar/handle/10915/155554 |
work_keys_str_mv |
AT martinjonathan patternsofmarkupuseinwikipedia AT torresdiego patternsofmarkupuseinwikipedia AT fernandezalejandro patternsofmarkupuseinwikipedia |
_version_ |
1771439110706167808 |