dc.contributor.author | Chen, Maomao | |
dc.contributor.author | Huang, Maoyi | |
dc.date.accessioned | 2016-06-27T11:52:26Z | |
dc.date.available | 2016-06-27T11:52:26Z | |
dc.date.issued | 2016-06-27 | |
dc.identifier.uri | http://hdl.handle.net/2077/44663 | |
dc.description.abstract | Identifying the topic of an article can involve a lot of manual work. The manual processes can
be exhaustive when it comes to a large volume of articles. In order to tackle this problem, we
propose an automated topic extraction approach, which is able to extract topics for a large
number of articles with a consideration to efficiency. To support the automatic topic
extraction, our research focuses on existing N-gram analysis, which only calculates the words
appearing frequency in a document. But in our research, we apply our customized filtering
standards to improve the efficiency. And also to eliminate the irrelevant or noncritical phrases
as many as possible. By doing that, we can make sure that our final selected keyphrases to
each article are unique labels, which can represent the core idea of each specific article. In our
case, we choose to focus on the research papers within the autonomous vehicle domain
because the research papers are highly demanded in our daily life. Since most of the research
papers are available only in PDF format, we need to process the PDF format files into the
editable file types such as TXT. In order to realize the automation, we have selected a large
number of autonomous vehicle-related articles to test our proposed idea. Then we observe the
result and compare it with the manual topic extraction result to evaluate our approach. | sv |
dc.language.iso | eng | sv |
dc.subject | automatic topic extraction | sv |
dc.subject | N-gram | sv |
dc.subject | keyphrase | sv |
dc.subject | frequency statistic | sv |
dc.title | Automatic Topic Extraction from Research Articles Using N-gram Analysis | sv |
dc.type | text | |
dc.setspec.uppsok | Technology | |
dc.type.uppsok | M2 | |
dc.contributor.department | Göteborgs universitet/Institutionen för data- och informationsteknik | swe |
dc.contributor.department | University of Gothenburg/Department of Computer Science and Engineering | eng |
dc.type.degree | Student essay | |