You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Given a list of tweet-IDs from Twitter, tools such as https://github.com/docnow/twarc makes it easy to extract the tweets. Using this to extract image URL's seems straight forward, so it should be possible to automate the generation of an image collage given just the list of tweet-IDs. By keeping the map of IDs<->images, a link back to the originating tweet on Twitter can be used for metadata.
Things to consider:
If multiple tweets points to the same image, should the image be shown once for each tweet or just once in total? The former can lead to hundreds of thousands of repetitions of the same image (see https://medium.com/on-archivy/exploring-womensmarch-dcc30221101c), while the latter "hides" the image among lesser-shared images and makes it problematic to provide links back to the originating tweets.
Should the images be downloaded before collage creation or fetched on the fly by juxta? This is tied to Spaces in filenames does not work #1 as repeat images would be fetched over the net once for each repetition. Also, the current version of juxta is not geared towards fetching from the web and will be effectively blocked by an adversary image server that trickle-serves images one byte at a time.
Given this, the best solution seems to #1 repeat images in the collage and #2 download them before generating the collage.
The text was updated successfully, but these errors were encountered:
This has been implemented in demo_twitter.sh, with the trade-offs as described above. I am not really sure that the best option is to repeat the duplicate images though. Maybe some sort of visual prioritization mechanism instead? Sorting by popularity? Changing border color? Permanent boxes around certain images?
Related to this, the de-duplication of images would only catch those with the same URL. This could be improved with checksums, but with re-sizing and re-compression of images on social media, even that does not guarantee that duplicates will be eliminated.
Given a list of tweet-IDs from Twitter, tools such as https://github.com/docnow/twarc makes it easy to extract the tweets. Using this to extract image URL's seems straight forward, so it should be possible to automate the generation of an image collage given just the list of tweet-IDs. By keeping the map of IDs<->images, a link back to the originating tweet on Twitter can be used for metadata.
Things to consider:
Given this, the best solution seems to #1 repeat images in the collage and #2 download them before generating the collage.
The text was updated successfully, but these errors were encountered: