Replies: 1 comment
-
I have the same situation. Would love to hear some feedback. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I have a very large dataset several hundreds of millions of rows. Does it make sense to subsample the data (take a million rows), learn a model and then apply it to the whole dataset? Or is there a better way? I tried to use blocking on the whole dataset 2 days later, it was still not finished and I got kicked off.. Any ideas would be greatly appreciated!!
Sincerely,
tom
Beta Was this translation helpful? Give feedback.
All reactions