-
-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Backfill teams - CE #4925
base: master
Are you sure you want to change the base?
Backfill teams - CE #4925
Conversation
5b7fd98
to
13d751f
Compare
site: s, | ||
inviter: inv | ||
} | ||
# preload: [inviter: inv, site: s] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🔥
@ruslandoga the belief here is, running this (with |
👋 Could you please summarise the changes / possible problems here? I don't know what to be looking out for :) |
Application.get_env(:plausible, Plausible.Repo)[:url] | ||
) | ||
|
||
@repo.start(db_url, pool_size: 2 * @max_concurrency) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can this be lowered? Some free-tier / entry level hosted PostgreSQL instances limit the pool size to 25. So the "normal" repo and this data migration repo would compete for available connections and flood the logs with non-actionable errors.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, this is adapted from what we've been running on prod.
|
||
orphaned_teams = | ||
from( | ||
t in Plausible.Teams.Team, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since CE can run this migration at any point in the future, would it be safer to hard code table names (here and in every other Ecto query) at the moment of the next CE release?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we have already had a similar problem with sites.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Because it's compiled within lib/
, I think we'll get warnings sooner?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think there would be any warnings.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we have already had a similar problem with sites.
Sorry, what problem exactly?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I cannot find it but basically there was a similar query that was using a schema and no select
and it was failing in CE because some column in the Ecto schema didn't yet exist in the database because the migration for that column was being run afterwards. I think you were the one who fixed it by adding an explicit select:
:)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The problem with these is that they only warn / raise at runtime.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Found it: #4459 (comment)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wonder if Elixir 1.18 would surface this at compile time 🤔
backfill(dry_run?) | ||
end | ||
|
||
defp backfill(dry_run?) do |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would it benefit from being a single transaction?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Otherwise it seems like if it fails half-way, it would always fail in the future due to insert conflicts.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The only reason for multiple transactions is to speed up execution time for many sites. Any specific insert conflicts you're predicting? We've been re-running it successfully. It will only catch up with records that require attention on subsequent executions. Setting up the pool size to one should make it possible to wrap in a single transaction, even with parallel Task
s spawned I think.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Any of the repo calls can raise (db pool timeout, no connection, etc.) and stop the execution thus leaving the DB in a potentially inconsistent state, where some records have already been inserted and inserting them again would cause a conflict. I don't know the specifics of this migration so I can't immediately provide examples where exactly that can happen. But simply wrapping it in a single transaction is easy and safe.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This backfill procedure was specifically designed in a away where it can be resumed at any point. Operation where consistency needs to be ensured across multiple db operations are wrapped up in a transaction. We can of course wrap it all up in a single transaction as well, if you prefer.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, let's wrap it in a single transaction and remove async operations.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If it would be easier, I can make the changes I commented about and then re-request review from the core team to make sure the intended migration logic is preserved.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ruslandoga oh my, just sat down to this PR but then saw your comment. Very much appreciated.
:pass | ||
end) | ||
|> Enum.with_index() | ||
|> Task.async_stream( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does it have to be async?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Only if we want to utilize the pool, shouldn't matter for small CE instances. I think we can remove any parallel processing then.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we can remove any parallel processing then.
Let's do this, I think it would be easier to understand and debug if it goes wrong for one of the self-hosters.
|> Ecto.Changeset.put_change(:updated_at, guest_invitation.updated_at) | ||
|> @repo.update!() | ||
|
||
if rem(idx, 1000) == 0 do |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It can probably be lowered for CE. Or just print each site that's being updated right now.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Absolutely 👍
Uku said it might still be too early to include Teams into CE, even after this PR. And we should instead wait until there are new features using the new Teams schemas. I'll keep on cherry-picking for the next release for now. I think this can stay open until the next full CE release. |
Great, thanks. Roughly, when will that be? |
Gah I'm sorry for not providing much context. |
No description provided.