Liang, S;
              
      
            
                Yilmaz, E;
              
      
            
                Shen, H;
              
      
            
                De Rijke, M;
              
      
            
                Croft, WB;
              
      
        
        
  
(2017)
  Search Result Diversification in Short Text Streams.
ACM Transactions on Information Systems
, 36
       (1)
    
    
    
    , Article 8.     10.1145/3057282.
  
  
       
    
  
| Preview | Text Yilmaz_a8-liang_accepted manuscript.pdf - Accepted Version Download (989kB) | Preview | 
Abstract
We consider the problem of search result diversification for streams of short texts. Diversifying search results in short text streams is more challenging than in the case of long documents, as it is difficult to capture the latent topics of short documents. To capture the changes of topics and the probabilities of documents for a given query at a specific time in a short text stream, we propose a dynamic Dirichlet multinomial mixture topic model, called D2M3, as well as a Gibbs sampling algorithm for the inference. We also propose a streaming diversification algorithm, SDA, that integrates the information captured by D2M3 with our proposed modified version of the PM-2 (Proportionality-based diversification Method -- second version) diversification algorithm. We conduct experiments on a Twitter dataset and find that SDA statistically significantly outperforms state-of-the-art non-streaming retrieval methods, plain streaming retrieval methods, as well as streaming diversification methods that use other dynamic topic models.
| Type: | Article | 
|---|---|
| Title: | Search Result Diversification in Short Text Streams | 
| Open access status: | An open access version is available from UCL Discovery | 
| DOI: | 10.1145/3057282 | 
| Publisher version: | http://dx.doi.org/10.1145/3057282 | 
| Language: | English | 
| Additional information: | This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions. | 
| Keywords: | Science & Technology, Technology, Computer Science, Information Systems, Computer Science, Diversity, ad hoc retrieval, data streams | 
| UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science | 
| URI: | https://discovery.ucl.ac.uk/id/eprint/10045775 | 
Archive Staff Only
|  | View Item | 
 
                      
