Skip to content

Incorrect splitting for separating hashtags #31

@neumannm

Description

@neumannm

In the case where there is another tweet, a picture or similar embedded in a tweet, the text of the tweet and this attachment are not separated. See this example:
https://twitter.com/ACM_CHIIR/status/837247495864479744
Hooray - the proceedings arrived today! #chiir2017pic.twitter.com/lLZMfTpR0F

This is by itself unsatisfying, but also leads to problems in the detection of a hashtag.

If like in the example, the last token of a tweet before the attachment is a hashtag, tweet.getHashtags() will output #chiir2017pic as a hashtag, where it should simply be #chiir2017 .

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions