In the case where there is another tweet, a picture or similar embedded in a tweet, the text of the tweet and this attachment are not separated. See this example:
https://twitter.com/ACM_CHIIR/status/837247495864479744
Hooray - the proceedings arrived today! #chiir2017pic.twitter.com/lLZMfTpR0F
This is by itself unsatisfying, but also leads to problems in the detection of a hashtag.
If like in the example, the last token of a tweet before the attachment is a hashtag, tweet.getHashtags() will output #chiir2017pic as a hashtag, where it should simply be #chiir2017 .
In the case where there is another tweet, a picture or similar embedded in a tweet, the text of the tweet and this attachment are not separated. See this example:
https://twitter.com/ACM_CHIIR/status/837247495864479744
Hooray - the proceedings arrived today! #chiir2017pic.twitter.com/lLZMfTpR0FThis is by itself unsatisfying, but also leads to problems in the detection of a hashtag.
If like in the example, the last token of a tweet before the attachment is a hashtag,
tweet.getHashtags()will output#chiir2017picas a hashtag, where it should simply be#chiir2017.