Skip to main content
Filter by
Sorted by
Tagged with
4 votes
2 answers
158 views

I am doing some time series clustering, and would like to align the dendrogram with the time series shapes. This is almost there: library(ggplot2) library(reshape2) library(stats) library(patchwork) ...
Neal Fultz's user avatar
  • 9,779
3 votes
3 answers
189 views

For my current project in C#, I am tasked with fetching customer details from a data source, 'cleansing' said customers (making sure the name is capitalised correctly, mobile formatted correctly, etc.)...
chickentaco's user avatar
2 votes
0 answers
96 views

I am grouping my data using the ward.D2 hierarchical clustering method in R. I need to calculate the distance and similarity for each step, from 2 to 20 clusters. Similarity is calculated using the ...
pnlp's user avatar
  • 35
0 votes
0 answers
90 views

I am doing a WGCNA analysis, and I am facing a lot of struggles. I used a signed network during my bwnet analysis (TOMType = "signed"). I thought this meant that only positive correlations ...
Laura's user avatar
  • 1
-1 votes
1 answer
140 views

I have images of graph lines with trends, and I want to cluster similar trends together. However, after trying several clustering algorithms, they are not working as well as I expected. I believe that ...
SpicyCurry's user avatar
0 votes
0 answers
58 views

I performed HDBSCAN Clustering hdbscan_clusterer = hdbscan.HDBSCAN(min_cluster_size=200) df['Cluster'] = hdbscan_clusterer.fit_predict(data_matrix_for_clustering) Now, I’m interested in getting the ...
name0's user avatar
  • 1
0 votes
0 answers
124 views

I am clustering a dataset with both numerical and categorical variables. To handle the high dimensionality, I performed dimensionality reduction separately for both types of inputs, retaining 21 ...
Guillon's user avatar
0 votes
1 answer
55 views

I implemented Vehicle Ad-hoc Network in NS3.38 and used default Node class in NS3. also I created Cluster Object that has some attributes. in some conditions I need to remove node from cluster and ...
Negar Javadzadeh's user avatar
0 votes
1 answer
138 views

I'm working on a clustering problem and I would like to use the hclust functions to create the dendrogram and cutreeDynamic to create clusters from the mentioned dendrogram. In fact, I have already ...
José Adrián Pardo Pérez's user avatar
0 votes
1 answer
177 views

I have computed the dissimilarity matrix using vegdist() function, and method specified as "morisita". However, even though hclust() function is built to read both distance or dissimilarity ...
Sukhraj Kaur 1910115's user avatar
1 vote
1 answer
99 views

I am trying to create a dendogram in R. As of now, I have used the factoextra package, and specifically the fviz_dend function. The code is as follows: num_data_scaled <- scale(num_data) res.hc &...
AlisAquilae's user avatar
1 vote
1 answer
53 views

I was trying to totate x-axis value (45 degree) in dendogram using fviz_dend function from factoextra package, but nothing works. I also tried to follow the answer in this post rotating dendogram x ...
Faisal Mustafa's user avatar
1 vote
1 answer
171 views

I have a large matrix (approximately 35,000 x 35,000) and I'm preparing a distance object in R for hierarchical clustering. The base R function dist() is too slow, so I'm using the distances function ...
titi's user avatar
  • 619
0 votes
0 answers
42 views

enter image description hereThis is a portion of my project where it yields the following graphic. I noticed how the correlation between EWH and MCHI (which is usually move in parallel to each other) ...
maeta cherdsatirakul's user avatar
1 vote
1 answer
602 views

I use the scikit-learn Agglomerative Clustering python library in my code to automatically cluster points and place a new, larger point in the center of the cluster. I have a set of several thousand ...
nxxn's user avatar
  • 13
1 vote
0 answers
102 views

In my Python code, I have a set of objects that I want to cluster based on a given distance matrix. However, there are some objects that should never end up in the same cluster. The number of clusters ...
Alex's user avatar
  • 11
0 votes
1 answer
41 views

I am a beginner with R, but I have been analysing a large data set of GPS data, made up of unique individuals (name) (approx 100 unique names) with 1,000,000+ lines of data. Each unique name has ...
Jackijones's user avatar
0 votes
1 answer
182 views

I am interested in plotting the tree represented by the output of hierarchy.to_tree(). To clarify my question, I give the following MWE: import numpy as np from scipy.cluster import hierarchy from ...
super_ask's user avatar
0 votes
1 answer
102 views

I am trying to configure opensips keepalived for nodes handling in cluster. I followed the instrustions mentioned in this guide. https://controlpanel.opensips.org/htmldoc_9_X_X/keepalived.html. After ...
Aqib Ali's user avatar
1 vote
0 answers
33 views

So I'm working on a project where I am using embeddings generated form Universal Sentence Encoder and giving them as input to kmeans clustering present in sklearn.cluster. The problem is that I ran ...
Shashank Shandilya's user avatar
1 vote
1 answer
477 views

I am running this relatively straightforward algorithm. if I understand the algorithm correctly if you cluster to, say, 8 clusters, you should had the results for all clusters above 8, right? Would ...
Reuben's user avatar
  • 113
1 vote
1 answer
278 views

Using species abundance data recorded for multiple samples, I want to create a dendrogram where branches represent the similarity of samples. The distance measure should be Bray-Curtis-similarity. For ...
empetrum's user avatar
  • 125
0 votes
1 answer
319 views

I have NGS data (Unique clones only) and I want to group them based on the similarity (clustering is preferable) using Python language. Please have a look into the below sample sequences. Also to ...
shivam's user avatar
  • 617
1 vote
1 answer
100 views

I have a dataset of short-term behaviors displayed by 30 individuals. #Load packages library(TraMineR) # Function to generate a random non-numerical sequence generate_random_sequence <- function(...
JatNTU's user avatar
  • 21
0 votes
2 answers
720 views

I am working in Python. I am using a binary dataframe in which I have a ser of values of 0 and 1 for diferent users at diferent times. I can perform hierarchical clustering directly from the dataframe ...
slow_learner's user avatar
0 votes
0 answers
89 views

I have this: And I want to have 25 clusters. So I used this: from sklearn.cluster import AgglomerativeClustering hc = AgglomerativeClustering(n_clusters = 25, affinity = 'euclidean', linkage = 'ward')...
Fatemeh Nazari's user avatar
1 vote
0 answers
76 views

I have created multiple correlational matrices for age ranges (e.g. 1-5, 5-10, 10-15 years old). However, when I do hierarchical clustering using ggcorrplot for example, running ggcorrplot(...
Erika's user avatar
  • 21
1 vote
1 answer
608 views

After doing k-means classification on a dataset (value of k = 3), I tried to find out the total entropy of all the clusters. (Total number of datapoints, or, the total length of the dataset was : 500) ...
SSaha13's user avatar
  • 11
1 vote
0 answers
69 views

I am trying to use the color_branches function from dendextend package to color the dendrogram branches of my heatmap create with complexheatmap. I bumped into a strange behaviour, as you can see in ...
Romina Appierdo's user avatar
0 votes
0 answers
83 views

Given a linkage matrix Z the resulting heatmap I get from g = sns.clustermap(corr) is different from the heatmap I get using sns.heatmap(corr[np.ix_(g.dendrogram_row.reordered_ind, g.dendrogram_row....
athantas's user avatar
1 vote
0 answers
88 views

I have a dataset that comprises several spatial path trajectories of variable lengths (i.e. a time series of X & Y coordinates). I am looking to group these based on the similarities of their ...
rb446611's user avatar
0 votes
1 answer
196 views

I am trying to convert a dendrogram into a graph/tree to perform calculations with its nodes, leaves and subtrees and find a miximal subgraphs, but I have not found a function/package that helps me in ...
gennifer montaño rodriguez's user avatar
1 vote
0 answers
68 views

I have been trying to find Dunns index using the Armadillo library for a larger algorithm I'm working on. Whenever I run the code, I get an output Dunns index:-nan(ind) and an error saying I'm out of ...
Duplic8e's user avatar
1 vote
0 answers
95 views

Running the below code (initially without par (lwd = 1) line ) returned the following error, and the same error was returned again after setting par(lwd = 1) or to 0.5 or 10. Truthfully don't totally ...
ghoeltzel's user avatar
2 votes
1 answer
134 views

I have a tibble with an id column, a G grouping variable, and 300 numeric variables. I want a method that clusters the raws to the point that each row is matched/paired in a cluster with another ...
GiulioGCantone's user avatar
0 votes
1 answer
282 views

Question 1 I got a Warning UserWarning: HDBSCAN can only compute 3 clusters. Setting n_clusters to 3... when I specified param n_clusters=4, using HDBSCAN_flat(). Can I get the max_eom_clusters ...
Dancingmeow's user avatar
0 votes
0 answers
90 views

I have built a cluster plot with fviz_cluster. shown in attached image. fviz_cluster(hcpc, palette = c("blue", "red", "black"), ellipse.type = "convex&...
Ikram Bashir's user avatar
0 votes
1 answer
164 views

Suppose that I had a dataset where I've done a cluster analysis with, say, k=9. Perhaps this has been from a k-means or I've just done a complete linkage hierarchical agglomeration or I eyeballed it ...
Vonvorv's user avatar
  • 31
1 vote
0 answers
33 views

My initial data is : data_init = pd.read_csv('data_merged.csv') Total periode to cover 25 months initial_period_data = data_init[(data_init['order_purchase_timestamp'] >= earliest_timestamp) & (...
Naina's user avatar
  • 11
1 vote
0 answers
23 views

In Hierarchical Clustering, what are the distance measurement methods used? Are different measurement methods used depending on the purpose? If performing Hierarchical Clustering for Region Proposal, ...
rem_maji_tenshi_'s user avatar
1 vote
0 answers
115 views

I am doing data grouping in python. I was able to group the set seamlessly using the K-modes algorithm and got 4 groups. Now it tries to do the same using the ROCK algorithm to compare the results. ...
Qba515's user avatar
  • 11
0 votes
0 answers
56 views

I have a dataset of 90000 rows and 200 columns. I trying to form clusters. I have reduced the size using pca. When try it in python I get MemoryError. I understand the problem is because of the no of ...
Vishweshwaran Sridhar's user avatar
1 vote
0 answers
169 views

I have two dendrograms resulting from hierarchical clustering and I want to visually compare them using a tanglegram. However, I only want to display a certain number of clusters that are cut at a ...
Ivan Z's user avatar
  • 1,622
2 votes
0 answers
301 views

I have a dataset in which I am measuring area of between 10-40 microglia for some 25ish subjects, each measured in 3 different slices of tissue. I want to do hierarchical clustering to ask if the area ...
Hannah Harder's user avatar
0 votes
0 answers
87 views

I have succeeded in changing the color of leaf labels in my dendrogram according to its classification, but I want to follow this coloring upwards. from scipy.cluster import hierarchy import ...
user22037973's user avatar
0 votes
2 answers
61 views

I have data like this: Which can also be seen as this PHP array: $items = [ ['item1' => 'a', 'item2' => 'c', 'item3' => 'h'], ['item1' => 'a', 'item2' => 'c', 'item3' => 'i']...
nunomira's user avatar
  • 398
1 vote
0 answers
189 views

I have this dendrogram: hc <- hclust(dist_s, method = 'average') At first, I was showing it with dendextend: dend = as.dendrogram(hc) par(mar = c(3, 2, 2, 8)) dend %>% set("labels_cex&...
Vitória Stavis's user avatar
-1 votes
3 answers
150 views

I want to extract a hierarchical structure from a table in the sql server database. Table look similar to this with levels going till lvl 10: lvl1 lvl2 lvl3 I want to extract a hierarchical structure ...
pnvm's user avatar
  • 1
1 vote
0 answers
623 views

I am trying to write a hierarchical agglomerative clustering algorithm from scratch without using most of the related libraries, and my program does successfully put the test set into the closest ...
resstudent's user avatar
0 votes
0 answers
305 views

I would like to use scipy.cluster.hierarchy to perform neighbor joining on a distance matrix. However, I have been unable to locate in the documentation that this is an available option. The reason ...
Jamie's user avatar
  • 792

1
2 3 4 5
24