cosine_similarity(x,y)

I attempted to apply the method to clustering tweets.  I may be misunderstanding how this works, but running it with cosine_similarity(matrix name) only worked when my data was very small (500 tweets).  Once I went to 150,000 tweets, I received memory errors.  I used what the documentation said here, http://scikit-learn.org/stable/modules/generated/sklearn.metrics.pairwise.cosine_similarity.html, by adding cosine_similarity(matrix[len - 1], matrix) which I found in another example elsewhere since lost.

Is there a reason your code runs it without passing the x and y separately?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cosine_similarity(x,y) #11

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

cosine_similarity(x,y) #11

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions