Georgetown University LogoGeorgetown University Library LogoDigitalGeorgetown Home
    • Login
    View Item 
    •   DigitalGeorgetown Home
    • Georgetown University Institutional Repository
    • Georgetown College
    • Department of Computer Science
    • Faculty Scholarship - Computer Science Department
    • View Item
    •   DigitalGeorgetown Home
    • Georgetown University Institutional Repository
    • Georgetown College
    • Department of Computer Science
    • Faculty Scholarship - Computer Science Department
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Membership Detection Using Cooperative Mining

    Cover for Membership Detection Using Cooperative Mining
    View/Open
    View/Open: Singh_MembershipDetection.pdf (949kB) Bookview

    Creator
    Newport, Calvin (Computer scientist)
    Singh, Lisa
    Ren, Yiqing
    Abstract
    More and more companies are providing data mining and analytics solutions to customers using social media data. The general approach taken by these companies is to continually collect data from social media sites and then use the collected snapshot of the content for a data mining or analytics task. Unfortunately, given the exponential increase in the volume of social media data, building local database snapshots and running computationally expensive algorithms is not always plausible. As an alternative to the centralized approach, in this paper, we study the feasibility of cooperative algorithms where data never leaves the mined social media network, and instead the network users themselves work together, using only the communication primitives provided by the social media site, to solve data mining problems. While cooperative algorithms can be built for many different data mining tasks, to show the viability of this approach, we focus on a task fundamental to many different social mining applications - membership detection (an individual using the social media site wants to efficiently get a request to a member of a known group with unknown membership). Using Twitter as our specific social graph, we seek cooperative algorithms that solve this problem with high probability even when we assume only a small fraction of the Twitter network participates and we enforce a bound on the number of tweets generated. After validating the potential of cooperative solutions on Twitter, we empirically evaluate a collection of cooperative strategies on a snapshot of the Twitter network containing over 50 million users. Our best solution, which we call brokered token passing, can reliably and efficiently detect group membership while requiring only a small number of tweets be sent and a small percentage of users participate.
    Permanent Link
    http://hdl.handle.net/10822/761530
    Date Published
    2014
    Subject
    Computer Science; Data mining; Social media;
    Type
    text
    Collections
    • Faculty Scholarship - Computer Science Department
    Metadata
    Show full item record

    Related items

    Showing items related by title, author, creator and subject.

    • Cover for Exploring graph mining approaches for dynamic heterogeneous networks

      Exploring graph mining approaches for dynamic heterogeneous networks 

      Singh, Lisa (2007)
      As we become a more 'connected' society, a greater need exists to understand complex network structures. While many in the field of data mining analyze network data, most models of networks are straightforward-focusing on ...
    Related Items in Google Scholar

    Georgetown University Seal
    ©2009 - 2023 Georgetown University Library
    37th & O Streets NW
    Washington DC 20057-1174
    202.687.7385
    digitalscholarship@georgetown.edu
    Accessibility
     

     

    Browse

    All of DigitalGeorgetownCommunities & CollectionsCreatorsTitlesBy Creation DateThis CollectionCreatorsTitlesBy Creation Date

    My Account

    Login

    Statistics

    View Usage Statistics

    Georgetown University Seal
    ©2009 - 2023 Georgetown University Library
    37th & O Streets NW
    Washington DC 20057-1174
    202.687.7385
    digitalscholarship@georgetown.edu
    Accessibility