Clustering analysis of brain protein expression levels in trisomic and control mice

Carly L. Clayman, Scott N. Clayman, Partha Mukherjee

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we describe a clustering analysis on 77 distinct brain protein expression levels of trisomic and control mice. Hierarchical clustering based on Euclidean distance results in clusters that partially coincide with experimental treatment groups of mice, as shown in dendrogram results. Normalization results in decreased within- and between-cluster sum of squares and a decreased ratio of between- to within-cluster sum of squares. The optimal number of clusters ranges from 1 to 4 clusters as determined by the gap statistic method or direct methods of the silhouette width or the elbow of total within-cluster sum of squares. Principal components analysis shows separation of clustered groups generated by k-means clustering. When clustered groups are plotted against the first two principal components, more distinct clusters are generated after z-score normalization of protein expression levels, compared to non-normalized results.

Original languageEnglish (US)
Title of host publicationProceedings of 3rd International Conference on Information System and Data Mining, ICISDM 2019
PublisherAssociation for Computing Machinery
Pages114-118
Number of pages5
ISBN (Electronic)9781450366359
DOIs
StatePublished - Apr 6 2019
Event3rd International Conference on Information System and Data Mining, ICISDM 2019 - Houston, United States
Duration: Apr 6 2019Apr 8 2019

Publication series

NameACM International Conference Proceeding Series

Conference

Conference3rd International Conference on Information System and Data Mining, ICISDM 2019
CountryUnited States
CityHouston
Period4/6/194/8/19

Fingerprint

Brain
Proteins
Principal component analysis
Statistics

All Science Journal Classification (ASJC) codes

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Cite this

Clayman, C. L., Clayman, S. N., & Mukherjee, P. (2019). Clustering analysis of brain protein expression levels in trisomic and control mice. In Proceedings of 3rd International Conference on Information System and Data Mining, ICISDM 2019 (pp. 114-118). (ACM International Conference Proceeding Series). Association for Computing Machinery. https://doi.org/10.1145/3325917.3325932
Clayman, Carly L. ; Clayman, Scott N. ; Mukherjee, Partha. / Clustering analysis of brain protein expression levels in trisomic and control mice. Proceedings of 3rd International Conference on Information System and Data Mining, ICISDM 2019. Association for Computing Machinery, 2019. pp. 114-118 (ACM International Conference Proceeding Series).
@inproceedings{6007cc5b6e704799aee64d665abf0997,
title = "Clustering analysis of brain protein expression levels in trisomic and control mice",
abstract = "In this paper, we describe a clustering analysis on 77 distinct brain protein expression levels of trisomic and control mice. Hierarchical clustering based on Euclidean distance results in clusters that partially coincide with experimental treatment groups of mice, as shown in dendrogram results. Normalization results in decreased within- and between-cluster sum of squares and a decreased ratio of between- to within-cluster sum of squares. The optimal number of clusters ranges from 1 to 4 clusters as determined by the gap statistic method or direct methods of the silhouette width or the elbow of total within-cluster sum of squares. Principal components analysis shows separation of clustered groups generated by k-means clustering. When clustered groups are plotted against the first two principal components, more distinct clusters are generated after z-score normalization of protein expression levels, compared to non-normalized results.",
author = "Clayman, {Carly L.} and Clayman, {Scott N.} and Partha Mukherjee",
year = "2019",
month = "4",
day = "6",
doi = "10.1145/3325917.3325932",
language = "English (US)",
series = "ACM International Conference Proceeding Series",
publisher = "Association for Computing Machinery",
pages = "114--118",
booktitle = "Proceedings of 3rd International Conference on Information System and Data Mining, ICISDM 2019",

}

Clayman, CL, Clayman, SN & Mukherjee, P 2019, Clustering analysis of brain protein expression levels in trisomic and control mice. in Proceedings of 3rd International Conference on Information System and Data Mining, ICISDM 2019. ACM International Conference Proceeding Series, Association for Computing Machinery, pp. 114-118, 3rd International Conference on Information System and Data Mining, ICISDM 2019, Houston, United States, 4/6/19. https://doi.org/10.1145/3325917.3325932

Clustering analysis of brain protein expression levels in trisomic and control mice. / Clayman, Carly L.; Clayman, Scott N.; Mukherjee, Partha.

Proceedings of 3rd International Conference on Information System and Data Mining, ICISDM 2019. Association for Computing Machinery, 2019. p. 114-118 (ACM International Conference Proceeding Series).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Clustering analysis of brain protein expression levels in trisomic and control mice

AU - Clayman, Carly L.

AU - Clayman, Scott N.

AU - Mukherjee, Partha

PY - 2019/4/6

Y1 - 2019/4/6

N2 - In this paper, we describe a clustering analysis on 77 distinct brain protein expression levels of trisomic and control mice. Hierarchical clustering based on Euclidean distance results in clusters that partially coincide with experimental treatment groups of mice, as shown in dendrogram results. Normalization results in decreased within- and between-cluster sum of squares and a decreased ratio of between- to within-cluster sum of squares. The optimal number of clusters ranges from 1 to 4 clusters as determined by the gap statistic method or direct methods of the silhouette width or the elbow of total within-cluster sum of squares. Principal components analysis shows separation of clustered groups generated by k-means clustering. When clustered groups are plotted against the first two principal components, more distinct clusters are generated after z-score normalization of protein expression levels, compared to non-normalized results.

AB - In this paper, we describe a clustering analysis on 77 distinct brain protein expression levels of trisomic and control mice. Hierarchical clustering based on Euclidean distance results in clusters that partially coincide with experimental treatment groups of mice, as shown in dendrogram results. Normalization results in decreased within- and between-cluster sum of squares and a decreased ratio of between- to within-cluster sum of squares. The optimal number of clusters ranges from 1 to 4 clusters as determined by the gap statistic method or direct methods of the silhouette width or the elbow of total within-cluster sum of squares. Principal components analysis shows separation of clustered groups generated by k-means clustering. When clustered groups are plotted against the first two principal components, more distinct clusters are generated after z-score normalization of protein expression levels, compared to non-normalized results.

UR - http://www.scopus.com/inward/record.url?scp=85068680101&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85068680101&partnerID=8YFLogxK

U2 - 10.1145/3325917.3325932

DO - 10.1145/3325917.3325932

M3 - Conference contribution

AN - SCOPUS:85068680101

T3 - ACM International Conference Proceeding Series

SP - 114

EP - 118

BT - Proceedings of 3rd International Conference on Information System and Data Mining, ICISDM 2019

PB - Association for Computing Machinery

ER -

Clayman CL, Clayman SN, Mukherjee P. Clustering analysis of brain protein expression levels in trisomic and control mice. In Proceedings of 3rd International Conference on Information System and Data Mining, ICISDM 2019. Association for Computing Machinery. 2019. p. 114-118. (ACM International Conference Proceeding Series). https://doi.org/10.1145/3325917.3325932