Conditional random fields in text segmentation by language

Robin Cabeza Ruiz

doi:10.18046/syt.v15i43.2712

Authors

Robin Cabeza Ruiz University of Holguín

DOI:

https://doi.org/10.18046/syt.v15i43.2712

Keywords:

Text segmentation by language, conditional random fields.

Abstract

This work presents using conditional random fields for solving the task of text segmentation by language, considering it as a sequence tagging task. Language changes are considered to occur in every part of the text, observations are assumed to be the words in the text, and the states are the different languages. Research let conclude that conditional random fields are a powerful tool for segmentation of multilingual text.

Author Biography

Robin Cabeza Ruiz, University of Holguín

Master in Design Assisted by Computer from the Universidad de Holguín (Cuba, 2015) with a bachelor’s degree in Computer Science from Universidad de Oriente (Cuba, 2017). Currently he is professor of informatics II and member of CAD/CAM Studies Center at the Faculty of Engineering at the Universidad de Holguín. His main areas of interest in research are biomechanical and text segmentation by computer.

Conditional random fields in text segmentation by language

Authors

DOI:

Keywords:

Abstract

Author Biography

Downloads

Published

Issue

Section

License

Developed By

Language

Information