Portal de Programas de Pós-Graduação (UnB)

SIGAA - Sistema Integrado de Gestão de Atividades Acadêmicas

PPGEST PROGRAMA DE PÓS-GRADUAÇÃO EM ESTATÍSTICA INSTITUTO DE CIÊNCIAS EXATAS Téléphone/Extension: Indisponible E-mail: Indisponible https://www.unb.br/pos-graduacao

Banca de QUALIFICAÇÃO: Lucas José Gonçalves Freitas

Uma banca de QUALIFICAÇÃO de MESTRADO foi cadastrada pelo programa.
STUDENT : Lucas José Gonçalves Freitas
DATE: 24/10/2022
TIME: 16:00
LOCAL: Sala Multiuso EST (A1-76/7)
TITLE:

Text clustering applied to imbalanced legal data.

KEY WORDS:

2030 UN Agenda; Machine Learning; Clustering and Text Classification.

PAGES: 58
BIG AREA: Ciências Exatas e da Terra
AREA: Probabilidade e Estatística
SUMMARY:

The Federal Supreme Court (STF), the highest instance of the Brazilian judicial system, produces, as well as courts of other instances, an immense amount of data organized in text form, through decisions, petitions, injunctions, appeals and other legal documents. Such documents are classified and grouped by public employees specialized in cataloging of judicial processes, which in specific cases use technological support tools. Some processes in the STF, for example, are classified under one or more sustainable development goals (SDGs) of the United Nations (UN) 2030 Agenda. As it is a repetitive task related to pattern recognition, it is possible to develop tools based on machine learning for this purpose. In this work, Natural Language Processing (NLP) models are proposed for clustering processes, in order to increase the database on certain sustainable development goals (SDGs) with few inputs naturally. The activity of clustering, which is of enormous importance in its own right, is also able to gather unlabeled entries around cases already classified by court officials, thus allowing new labels to be allocated to similar cases. Preliminary results show that cluster-augmented sets can be used in supervised learning flows to aid in legal texts classification, especially in contexts with unbalanced data.

BANKING MEMBERS:
Presidente - 1953590 - THAIS CARVALHO VALADARES RODRIGUES
Interno - 1465423 - ANDRE LUIZ FERNANDES CANCADO
Interno - 3000020 - GUILHERME SOUZA RODRIGUES

Notícia cadastrada em: 20/10/2022 10:34