K-Median



Relevant Links:
Emily
Research Home
My Journal
On Another Note
Clustering Methods and Analyses:

KNN Clustering

K-Median Clustering

Kaplan-Meier Estimation

Top Genes based on Variance

Cross-Validation

Significant Genes

Possible Cluster Causes



Back to K-Median results




The Harvard Dataset provided statistics on the age, gender, smoking habits, and percentage of the maximum tumor size on the patients within its study. Using these, the average and variance were taken over the members of each group. The results are as follows:
Cluster SizeTestGroup 1Group 2Group 3Group 4
AverageVarianceAverageVarianceAverageVarianceAverageVariance
K=2smoking43.407691246.882201250.73----
gender (m=0 f=1)0.5846150.2466350.5263160.263158----
age60.5119.852465124.2515----
max tumor %6080073.42105169.5906----
K=3smoking43.251350.06453.71277.59642.906251077.776--
gender(m=0 f=1)0.50.260870.5555560.2614380.6190480.24158--
age62.5952490.5394963104.588262.625194.2446--
max tumor %72.02381306.170273.05556176.879164.58333315.0362--
K=4smoking43.843751342.49141.428571190.1152.447061327.42347.35609.1125
gender (m=0 f=1)0.6041670.2442380.50.2692310.5882350.2573530.40.3
age61.85417119.914563.2142988.3351663.64706103.117666354.5
max_tumor %69.6875347.240765276.923173.82353176.65447680







Questions or Comments?
Email Me! Emily K. Mower