-
Notifications
You must be signed in to change notification settings - Fork 336
Open
Description
I attempted to apply the method to clustering tweets. I may be misunderstanding how this works, but running it with cosine_similarity(matrix name) only worked when my data was very small (500 tweets). Once I went to 150,000 tweets, I received memory errors. I used what the documentation said here, http://scikit-learn.org/stable/modules/generated/sklearn.metrics.pairwise.cosine_similarity.html, by adding cosine_similarity(matrix[len - 1], matrix) which I found in another example elsewhere since lost.
Is there a reason your code runs it without passing the x and y separately?
Metadata
Metadata
Assignees
Labels
No labels