-
Notifications
You must be signed in to change notification settings - Fork 130
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Large put request fails with connection reset #311
Comments
Thank you for opening your first issue in this project! Engagement like this is essential for open source projects! 🤗 |
Any help here please? |
Sorry for leaving you hanging. Can you share the logs from CHP and the backend server (ideally with configurable-http-proxy --log-level debug --timeout 300000 --proxy-timeout 300000 # 5 minute timeout for each There's probably a configuration parameter in node-http-proxy we need to expose. Including the traceback from CHP if there is one would help pin down what's needed, whether it's in the client configuration or server. |
Hi! Sorry for the huge delay. The logs shows that the proxy of content/ api but I couldn't find any error logs..
Thanx a lot! |
I'm unsure about configuration to solve your issue in CHP, but I wonder what evidence there is that CHP is to blame compared to another part in the network chain. Anyone that can reproduce this on another k8s cluster setup, such on on GKE or AKS would reduce the chances it is configuration in a AWS component managing incoming traffic. Perhaps you can write down more details about your entire setup? How is network traffic flowing? The JupyterHub Helm chart will let traffic go from (the autohttps pod running Traefik ->) the proxy pod running CHP -> the user pod when it comes to saving a notebook. Those could be at fault - but then there is components outside control of the JupyterHub Helm chart as well that could be at fault. Do we have a way to pinpoint the issue to what component is causing the trouble? |
What caused the suspicion in chp is: when we added a pod (classic notebook separated from the jupyter helm) to our cluster saving large notebooks works (taking more than 2 minutes, but work). when we added chp as a proxy to it, the issue reappeared. |
Ah that is a great test to pinpoint it to CHP @barperez111! Is it correct then that the network traffic has gone the same paths, but in one case it went directly to a user pod instead through CHP to the user pod - in both situations using the same other network infrastructure? |
Yes I believe that is correct. |
Hi @barperez111 I am running on a similar problem while trying to upload files >10MB, I have read the information about modify tornado websockets and body size and memory on jupyter notebook configuration but still facing the same issue for upload large files. One question, how do you modify the parameters timeout & proxy-timeout for Configurable-http-proxy? I mean this modification was via Jupyterhub config file? thanks for your help. |
Bug description
Hi! i encounter something that feels like a bug. it is related to what is talked about here.
Basically, we run z2jh on eks. when saving large notebooks the correspondent http put request sent by content manager fails with connection reset after =~ 1 min.
To check things, we ran a standalone notebook on the same cluster. using it, large files are saved just fine (after something like 2 minutes). that lead me thinking that the issue is chp related, so i added a chp before the stand alone notebook, and the issue appeared (getting connection reset after =~ 1 min ).
I tried setting --timeout and --proxy-timeout params but that didn't help.. log debug level didn't help me either.
any thoughts? are we sure timeout params working well?
Ill Appreciate any help whatsoever ,
thanx!
The text was updated successfully, but these errors were encountered: