Por favor, use este identificador para citar o enlazar este ítem: https://rd.uffs.edu.br/handle/prefix/5001
Type: Artigo Cientifico
Título : A new strategy to seed selection for the high recall task
Author: Todescato, Matheus Vinícius
First advisor: Dal Bianco, Guilherme
Resumen : High Recall Information Retrieval (HIRE) aims at identifying all (or nearly all) relevant documents given a query. HIRE, for example, is used in the systematic literature review task, where the goal is to identify all relevant scientific articles. The documents selected by HIRE as relevant define the user effort to identify the target information. On this way, one of HIRE goals is only to return relevant documents avoiding overburning the user with non-relevant information. Traditionally, supervised machine learning algorithms are used as HIRE’ core to produce a ranking of relevant documents (e.g. SVM). However, such algorithms depend on an initial training set (seed) to start the process of learning. In this work, we propose a new approach to produce the initial seed for HIRE focus on reducing the user effort. Our approach combines an active learning approach with a raking strategy to select only the informative examples. The experimentation shows that our approach is able to reduce until 18% the labeling effort with competitive recall.
Palabras clave : Informação
Aprendizagem
Documento
Recuperação da informação
Ciência da computação
Language: por
Country: Brasil
Editorial : Universidade Federal da Fronteira Sul
Acronym of the institution: UFFS
College, Institute or Department: Campus Chapecó
Type of Access: Acesso Aberto
URI : https://rd.uffs.edu.br/handle/prefix/5001
Fecha de publicación : 12-may-2021
Aparece en las colecciones: Ciência da Computação

Ficheros en este ítem:
Fichero Descripción Tamaño Formato  
TODESCATO.pdf908,86 kBAdobe PDFVisualizar/Abrir


Los ítems de DSpace están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.