Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ERROR mpool push: failed to push message #12794

Closed
5 of 11 tasks
pk-controller opened this issue Dec 17, 2024 · 10 comments
Closed
5 of 11 tasks

ERROR mpool push: failed to push message #12794

pk-controller opened this issue Dec 17, 2024 · 10 comments
Labels
kind/bug Kind: Bug

Comments

@pk-controller
Copy link

Checklist

  • This is not a security-related bug/issue. If it is, please follow please follow the security policy.
  • I have searched on the issue tracker and the lotus forum, and there is no existing related issue or discussion.
  • I am running the Latest release, the most recent RC(release canadiate) for the upcoming release or the dev branch(master), or have an issue updating to any of these.
  • I did not make any code changes to lotus.

Lotus component

  • lotus daemon - chain sync
  • lotus fvm/fevm - Lotus FVM and FEVM interactions
  • lotus miner/worker - sealing
  • lotus miner - proving(WindowPoSt/WinningPoSt)
  • lotus JSON-RPC API
  • lotus message management (mpool)
  • Other

Lotus Version

Daemon: 1.31
lotus version 1.31.0

./lotus-shed send-csv --from <FIL source address> <filename>.csv

Repro Steps

  1. ./lotus-shed send-csv --from .csv
  2. It happens whenever running a batch send, batch approval, or batch proposal
  3. after 10 messages or so, this varies
  4. ERROR mpool push: failed to push message: failed to add locked: too many pending message for actor
    ...

Describe the Bug

So whenever we are sending a batch proposal, approval, or batch send. It will execute about 10 messages and then we get the error. We have seen this error now on 3 different computers.

Logging Information

N/A
@pk-controller pk-controller added the kind/bug Kind: Bug label Dec 17, 2024
@github-project-automation github-project-automation bot moved this to 📌 Triage in FilOz Dec 17, 2024
@rvagg
Copy link
Member

rvagg commented Dec 18, 2024

@pk-controller did this used to work but is now not working? I guess you're using a public API, like glif. The "untrusted" mpool push method has a limit of 10 messages at a time, to prevent spam attacks. A "trusted" path has 1000, but that would require not going through the gateway which imposes this limit, but running your own node or getting access to a node directly.

We could change send-csv to do smarter batching, or detect this error and adjust, but it would slow it down.

@jennijuju
Copy link
Member

thanks rod

We could change send-csv to do smarter batching, or detect this error and adjust, but it would slow it down.

I don’t think we would be able to prioritize this as this is not a main use case of lotus.

@pk-controller Id suggest you to consider to run your own node, or reach out to @ArseniiPetrovich from Protofire for node service/api services.

@pk-controller
Copy link
Author

pk-controller commented Dec 18, 2024

@rvagg yes this used to work no issue, for the last 4 years, we would run up to 300-400. it just changed to the 10 maybe a month or two ago. I have always been using a lite node, it may have changed, but not recently. I use the glif lite node:
FULLNODE_API_INFO=wss://wss.node.glif.io/apigw/lotus lotus daemon --lite

@ArseniiPetrovich
Copy link
Contributor

Hm, we haven't changed anything related to this on our end. The only thing is that without access key it is nowadays not possible to send over 100 requests per minute, but this was the case for many months now, if not for a year. Not sure how it was working on the 300-400 scale then at all.

@rvagg
Copy link
Member

rvagg commented Dec 18, 2024

Oh, I recall now. In #12431 I did a couple of things, the main thing was plugging a hole whereby eth_sendTransactionRaw was using an untrusted path so it was possible to spam a gateway with messages and not be limited, but I was also fixing a hole with MpoolPush that was completely ignoring the fact that a push should be "untrusted" in certain circumstances, thereby undoing the whole point of having a trusted/untrusted path split for gateway vs raw access. You can see this fix in chain/messagepool/messagepool.go. I didn't call attention to it in the PR but we had discussed this internally and agreed to ship the fix quietly.

So what you're now bumping in to is the proper, intended limitations of the original code, you just happened to be bypassing it and were being allowed to spam glif with your messages where you probably shouldn't. I think the right way to do this is to use an API key with glif and/or negotiate with @ArseniiPetrovich to get access to a raw managed node or some other kind of un-capped API.

@rjan90
Copy link
Contributor

rjan90 commented Jan 6, 2025

I think this issue can be closed now as it's working as intended. The limitation of 10 messages for untrusted MpoolPush was always meant to be there, but was accidentally being bypassed, but which has now been fixed in #12431.

For users needing to push larger batches of messages at once, the recommended solutions are:

  1. Get an API key from a RPC provider that has higher limits
  2. Work with provider to get managed node access
  3. Run your own full node

Closing as this is not a bug but rather a limitation that is now correctly being enforced.

@rjan90 rjan90 closed this as completed Jan 6, 2025
@github-project-automation github-project-automation bot moved this from 📌 Triage to 🎉 Done in FilOz Jan 6, 2025
@rjan90 rjan90 moved this from 🎉 Done to ☑️ Done (Archive) in FilOz Jan 6, 2025
@pk-controller
Copy link
Author

Can you recommend an RPC provider(s) that will let us push more than 10 messages?

@rvagg
Copy link
Member

rvagg commented Jan 7, 2025

@ArseniiPetrovich does glif allow a higher rate with an access key? Is there some special tier you can sell PL? Maybe it would be best if they had raw access to their own splitstore node that they can push through however messages they like.

@dumikau
Copy link
Contributor

dumikau commented Jan 7, 2025

@pk-controller Hey! My Slack handle in the Filecoin workspace is @Ales Dumikau - Protofire - Glif Nodes . Let's discuss the node you need in more detail there.

@ArseniiPetrovich
Copy link
Contributor

Sorry, folks, caught a serious flu and was not very responsive back in the days. @dumikau thanks for jumping in, and thanks @rvagg for pinging me.
I'm all good and back now, but it seems we haven't received any message from @pk-controller so far. Is this issue still relevant for you?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Kind: Bug
Projects
Status: ☑️ Done (Archive)
Development

No branches or pull requests

6 participants