Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sponsored posts not filtered out #342

Open
LinqLover opened this issue Aug 26, 2020 · 0 comments
Open

Sponsored posts not filtered out #342

LinqLover opened this issue Aug 26, 2020 · 0 comments

Comments

@LinqLover
Copy link
Contributor

We are scraping tweets by a certain query on a regular basis and storing the results into a database. Recently we found a rather small number of tweets not matching our query we pass to the twitterscraper in any way neither being a response to a relevant post. These tweets look like sponsored posts that possibly were not filtered out correctly.

However, while we have been running the twitterscraper every day for ~9 months since now and scraped nearly 5000 tweets, there were only ca. 16 unmatching tweets in our database, so the problem appears to occur highly sporadically. Some, but not all of tweets are also available in a backup from 2020-08-01, so the problem might be a bit older but still persists.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant