Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Move files.opentreeoflife.org to S3 #126

Open
jar398 opened this issue Feb 21, 2017 · 6 comments
Open

Move files.opentreeoflife.org to S3 #126

jar398 opened this issue Feb 21, 2017 · 6 comments

Comments

@jar398
Copy link
Member

jar398 commented Feb 21, 2017

The site is currently on a surplus server (varela.csail.mit.edu) that works well for the time being and is free, but if it fails in any way, recovery is not guaranteed.

I estimated S3 hosting costs at the highest service level and it comes to about $ $.28 per month for 12G.

Information on hosting web sites on S3: https://aws.amazon.com/blogs/aws/host-your-static-website-on-amazon-s3/

S3 is not the only choice, but we already have an Amazon account so it should be straightforward.

@jar398
Copy link
Member Author

jar398 commented Mar 9, 2017

It looks like s3-hosted web sites do not support directory indexes. I find the indexes to be quite useful because they allow one to browse to see what's available on the server. E.g.

http://files.opentreeoflife.org/ott/
http://files.opentreeoflife.org/synthesis/
http://files.opentreeoflife.org/ncbi/
http://files.opentreeoflife.org/fung/
etc.

One might use this to answer questions like:

  • Do I have the latest draft of the OTT version that is under development? or
  • Is the unpacked output/ directory available for the previous version of the synthetic tree?

Going to s3 would require either foregoing this feature, or implementing it ourselves via some kind of script (that would write index.html files, etc.). The latter sounds fragile to me, but maybe it could be made to work.

@jar398
Copy link
Member Author

jar398 commented Mar 9, 2017

Another issue is that there is no server for running 'tar' on. You have to unpack locally and then send the files using 'aws s3 sync' or equivalent. For synthesis version 8.0 this is more than 50,000 files. I have no idea how long that would take. Maybe I will try it later today...

@jar398
Copy link
Member Author

jar398 commented Mar 9, 2017

Also, there seems to be no way to make symbolic links, breaking our 'current' links for synth and taxonomy (they would have to be maintained on a different server)

@jar398
Copy link
Member Author

jar398 commented Mar 9, 2017

@jar398
Copy link
Member Author

jar398 commented Mar 10, 2017

spoke too soon about listings, too.
https://github.com/rufuspollock/s3-bucket-listing
http://opentreeoflife-files.s3-website-us-west-2.amazonaws.com/ott/ott3.0/
This solution requires putting a special index.html in every directory.

@jar398
Copy link
Member Author

jar398 commented Mar 17, 2017

S3 pricing: https://aws.amazon.com/s3/pricing/
Data transfer is $.01 per Gb, so basically nothing in our case (the site is 11G, so 11 cents to grab the whole site, all versions of everything). There is a per-request fee of $0.005 per 1,000 PUT requests (less for GET). To transfer the 50,000 files belonging to a single synthesis release (if we decide to continue hosting the output/ trees on the files site) would be about 25 cents. We are not talking large numbers.

This is a simplification - prices vary depending on various plan parameters. But I don't think the numbers would be much higher than this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant