Abstracts Category : Other

Add abstract

Want to add your dissertation abstract to this database? It only takes a minute!

Search abstract

Search for abstracts by subject, author or institution

Share this abstract

Extrao de informaes de conferncias em pginas web

by Cssio Alan Garcia

Institution: Universidade do Rio Grande do Sul
Department:
Degree:
Year: 2017
Keywords: Information Extraction; Banco de dados; Recuperacao : Informacao; Conditional Random Fields; Servios Web
Posted: 2/1/2018 12:00:00 AM
Record ID: 2152060
Full text PDF: http://hdl.handle.net/10183/170942


Abstract

A escolha da conferncia adequada para o envio de um artigo uma tarefa que depende de diversos fatores: (i) o tema do trabalho deve estar entre os temas de interesse do evento; (ii) o prazo de submisso do evento deve ser compatvel com tempo necessrio para a escrita do artigo; (iii) localizao da conferncia e valores de inscrio so levados em considerao; e (iv) a qualidade da conferncia (Qualis) avaliada pela CAPES. Esses fatores aliados existncia de milhares de conferncias tornam a busca pelo evento adequado bastante demorada, em especial quando se est pesquisando em uma rea nova. A fim de auxiliar os pesquisadores na busca de conferncias, o trabalho aqui desenvolvido apresenta um mtodo para a coleta e extrao de dados de sites de conferncias. Essa uma tarefa desafiadora, principalmente porque cada conferncia possui seu prprio site, com diferentes layouts. O presente trabalho apresenta um mtodo chamado CONFTRACKER que combina a identificao de URLs de conferncias da Tabela Qualis identificao de deadlines a partir de seus sites. A extrao das informaes realizada independente da conferncia, do layout do site e da forma como so apresentadas as datas (formatao e rtulos). Para avaliar o mtodo proposto, foram realizados experimentos com dados reais de conferncias da Cincia da Computao. Os resultados mostraram que CONFTRACKER obteve resultados significativamente melhores em relao a um baseline baseado na posio entre rtulos e datas. Por fim, o processo de extrao executado para todas as conferncias da Tabela Qualis e os dados coletados populam uma base de dados que pode ser consultada atravs de uma interface online. Choosing the most suitable conference to submit a paper is a task that depends on various factors: (i) the topic of the paper needs to be among the topics of interest of the conference; (ii) submission deadlines need to be compatible with the necessary time for paper writing; (iii) conference location and registration costs; and (iv) the quality or impact of the conference. These factors allied to the existence of thousands of conferences, make the search of the right event very time consuming, especially when researching in a new area. Intending to help researchers finding conferences, this work presents a method developed to retrieve and extract data from conference web sites. Our method combines the identification of conference URL and deadline extraction. This is a challenging task as each web site has its own layout. Here, we propose CONFTRACKER, which combines the identification of the URLs of conferences listed in the Qualis Table and the extraction of their deadlines. Information extraction is carried out independent from the pages layout and how the dates are presented. To evaluate our proposed method, we carried out experiments with real web data from Computer Science conferences. The results show that CONFTRACKER outperformed a baseline method based on the position of labels and dates. Finaly, the extracted data is stored in a database to beAdvisors/Committee Members: Moreira, Viviane Pereira.

Add abstract

Want to add your dissertation abstract to this database? It only takes a minute!

Search abstract

Search for abstracts by subject, author or institution

Share this abstract

Featured Books

Book cover thumbnail image
Electric Cooperative Managers' Strategies to Enhan...
by White, Michael Edward
   
Book cover thumbnail image
The Filipina-South Floridian International Interne... Agency, Culture, and Paradox
by Haley, Pamela S.
   
Book cover thumbnail image
Bullied! Coping with Workplace Bullying
by Gattis, Vanessa M.
   
Book cover thumbnail image
Commodification of Sexual Labor Contribution of Internet Communities to Prostituti...
by Young, Jeffrey R.
   
Book cover thumbnail image
The Census of Warm Debris Disks in the Solar Neigh...
by Patel, Rahul I.
   
Book cover thumbnail image
Performance, Managerial Skill, and Factor Exposure...
by Avci, S. Burcu
   
Book cover thumbnail image
The Deritualization of Death Toward a Practical Theology of Caregiving for the ...
by Gibson, Charles Lynn
   
Book cover thumbnail image
Emotional Intelligence and Leadership Styles Exploring the Relationship between Emotional Intel...
by Olagundoye, Eniola O.
   
Book cover thumbnail image
Solution or Stalemate? Peace Process in Turkey, 2009-2013
by Yurtbay, Baturay
   
Book cover thumbnail image
Risk Factors and Business Models Understanding the Five Forces of Entrepreneurial R...
by Miles, D. Anthony