Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.
Sign upTweetUtils - support fulltext #192
Comments
This comment has been minimized.
Show comment
Hide comment
This comment has been minimized.
greebie
Apr 6, 2018
Contributor
Going through the twitter ingestion script, it might be nice to include a few other things if possible from the api. Retweet_count for instance. Possibly things like "location" and "in_reply_to" depending on how far we want to go with that and ethical considerations.
Here's a short list from my perspective and maybe we can check the things we want to include, and then we can fix things all together:
- in_reply_to_status_id
- in_reply_to_user_id
- source (mobile or web interface)
- entities/user_mentions
- entities/urls
- entities/hashtags (could replace the hashtag extractor we have in aut)
- retweeted (true / false)
- place
- retweet_count
- retweet_status { provides another tweet object so maybe not }
- user/profile_image
- user/description
- user/time_zone
- user/location
- user/name
- user/url
This is just a rough list of possible candidates. Not necessarily suggesting we should ingest them all or even one. Just think it's worth looking at while we fix TweetUtils.
Going through the twitter ingestion script, it might be nice to include a few other things if possible from the api. Retweet_count for instance. Possibly things like "location" and "in_reply_to" depending on how far we want to go with that and ethical considerations. Here's a short list from my perspective and maybe we can check the things we want to include, and then we can fix things all together:
This is just a rough list of possible candidates. Not necessarily suggesting we should ingest them all or even one. Just think it's worth looking at while we fix TweetUtils. |
This comment has been minimized.
Show comment
Hide comment
This comment has been minimized.
@greebie sounds like a separate issue. Can you create one? |
ruebot commentedApr 6, 2018
Since the expanded tweet move in early 2017, and the move to 280 characters, the text of tweet will be found in
.full_text
instead of.text
.https://developer.twitter.com/en/docs/tweets/tweet-updates