A Semantic Graph-Based Approach for Mining Common Topics From Multiple Asynchronous Text Streams

Long Chen, Joemon M Jose, Haitao Yu, Fajie Yuan

Research output: Chapter in Book/Report/Conference proceedingConference proceedingpeer-review

3 Citations (Scopus)

Abstract

In the age of Web 2.0, a substantial amount of unstructured content are distributed through multiple text streams in an asynchronous fashion, which makes it increasingly difficult to glean and distill useful information. An effective way to explore the information in text streams is topic modelling, which can further facilitate other applications such as search, information browsing, and pattern mining. In this paper, we propose a semantic graph based topic modelling approach for structuring asynchronous text streams. Our model integrates topic mining and time synchronization, two core modules for addressing the problem, into a unified model. Specifically, for handling the lexical gap issues, we use global semantic graphs of each timestamp for capturing the hidden interaction among entities from all the text streams. For dealing with the sources asynchronism problem, local semantic graphs are employed to discover similar topics of different entities that can be potentially separated by time gaps. Our experiment on two real-world datasets shows that the proposed model significantly outperforms the existing ones.
Original languageEnglish
Title of host publicationWWW '17: Proceedings of the 26th International Conference on World Wide Web
PublisherACM
Pages1201-1209
Number of pages9
ISBN (Print)9781450349130
DOIs
Publication statusPublished - 3 Apr 2017
Externally publishedYes
Event26th International Conference on World Wide Web - Perth, Australia
Duration: 3 Apr 20177 Apr 2017

Conference

Conference26th International Conference on World Wide Web
Abbreviated titleWWW'17'
Country/TerritoryAustralia
CityPerth
Period3/04/177/04/17

Fingerprint

Dive into the research topics of 'A Semantic Graph-Based Approach for Mining Common Topics From Multiple Asynchronous Text Streams'. Together they form a unique fingerprint.

Cite this