Data on Jupyterhub is not syncing

ljchang · September 29, 2020, 8:58pm

Each student runs a virtual server that is accessed via jupyterhub. Our jupyterhub server is managed by @arnsong from Research Computing. Each server mounts an S3 bucket that will copy over data and notebooks from a central cloud storage S3 bucket. Every time you launch your server, data should synchronize with the central storage and copy over any changes. However, sometimes this process doesn’t complete and you will not have the most up to date files. So far this seem to be more likely to occur whenever there is a big change in data (i.e., we upload a large dataset to the central storage).

How do I sync my jupyterhub server?

ljchang · September 29, 2020, 8:59pm

If you are finding that you are missing files or do not have access to the most up to date notebooks, then you will need to manually run the startup script in your jupyterhub server via a terminal.

Open a new terminal by clicking the New menu and selecting Terminal

2020-09-29_16-53-471752×485 52.1 KB
Run the /usr/local/share/startup.sh script by pasting the command and hitting enter. This should start syncing data between the central S3 bucket and your local storage.

2020-09-29_16-55-031731×520 32.4 KB

Topic		Replies	Views
How do I get access to data and code if I'm not a Dartmouth Student? DartBrains.org	0	640	July 13, 2021
How do I run a jupyter notebook server on discovery? Discovery Questions	2	1164	March 1, 2022
Requirements.txt DartBrains.org	3	580	July 15, 2021
S3 AU data timepoints CompSAN Data Competition	1	338	March 28, 2022
Data format for Tutorials DartBrains.org	3	830	September 15, 2020

Data on Jupyterhub is not syncing

Related topics