Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Split examples for clustered analysis #464

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

bertilhatt
Copy link
Contributor

  • The User/Session split is a very general use case, not just for social media companies
  • Social media also use this feature for clusters of friends, so if you introduce video calls to some, your friends can video-call, too.

* The User/Session split is a very general use case, not just for social media companies
* Social media also use this feature for clusters of friends, so if you introduce video calls to some, your friends can video-call, too.
Copy link

netlify bot commented Sep 12, 2024

Deploy Preview for eppo-data-docs ready!

Name Link
🔨 Latest commit 75fbb16
🔍 Latest deploy log https://app.netlify.com/sites/eppo-data-docs/deploys/66e2a38292718c0008fefcdf
😎 Deploy Preview https://deploy-preview-464--eppo-data-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

@@ -4,7 +4,8 @@ Clustered analysis is available in cases where there is a need to compute metric

Common cases include but are not limited to:
- B2B companies that randomize at the company level, but wants to look at user level metrics
- Social media companies that randomize at the user level (for a consistent user experience), but want to look at session-level metrics
- Marketplaces and Social media companies who want to cluster users more likely to interact with each to share common feature
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this suggesting randomizing on those clusters of users? I worry that in many cases there won't be enough clusters to satisfy the CLT. My understanding is that the current approach to clusters on works if you have at least a few hundred cluster samples

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also, this sentence could be clearer:

.. who want to cluster users more likely to interact with each to share common feature

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants