内页1170*80广告
770*90广告

【生信技能】基因分型Cluster:ConsensusClusterPlus包进行聚类分析

内页上720*80广告

此内容仅用于个人学习和研究

#聚类代码

maxK=9

results=ConsensusClusterPlus(data,

              maxK=maxK,

              reps=50,

              pItem=0.8,

              pFeature=1,

              title=workDir,

              clusterAlg="pam",

              distance="euclidean",

              seed=123456,

              plot="png")

Arguments-各个参数说明

d

data to be clustered; either a data matrix where columns=items/samples and rows are features. For example, a gene expression matrix of genes in rows and microarrays in columns, or ExpressionSet object, or a distance object (only for cases of no feature resampling)

maxK

integer value. maximum cluster number to evaluate.

reps

integer value. number of subsamples.

pItem

numerical value. proportion of items to sample.

pFeature

numerical value. proportion of features to sample.

clusterAlg

character value. cluster algorithm. 'hc' hierarchical (hclust), 'pam' for paritioning around medoids, 'km' for k-means upon data matrix, or a function that returns a clustering. See example and vignette for more details.

title

character value for output directory. Directory is created only if plot is not NULL or writeTable is TRUE. This title can be an abosulte or relative path.

innerLinkage

hierarchical linkage method for subsampling.

finalLinkage

hierarchical linkage method for consensus matrix.

distance

character value. 'pearson': (1 - Pearson correlation), 'spearman' (1 - Spearman correlation), 'euclidean', 'binary', 'maximum', 'canberra', 'minkowski" or custom distance function.

ml

optional. prior result, if supplied then only do graphics and tables.

tmyPal

optional character vector of colors for consensus matrix

seed

optional numerical value. sets random seed for reproducible results.

plot

character value. NULL - print to screen, 'pdf', 'png', 'pngBMP' for bitmap png, helpful for large datasets.

writeTable

logical value. TRUE - write ouput and log to csv.

weightsItem

optional numerical vector. weights to be used for sampling items.

weightsFeature

optional numerical vector. weights to be used for sampling features.

res

result of consensusClusterPlus.

verbose

boolean. If TRUE, print messages to the screen to indicate progress. This is useful for large datasets.

corUse

optional character value. specifies how to handle missing data in correlation distances 'everything','pairwise.complete.obs', 'complete.obs' see cor() for description.

文章来源:ConsensusClusterPlus包进行聚类分析

学习心得

作为一名科研工作者,时时刻刻都在和时间赛跑。吾必须时时刻苦钻研科研,掌握现代科研的知识和技能。愿在未来,不忘初心,牢记使命。只争朝夕,不负韶华,为成为一名优秀的科研工作者而努力,努力践行科学发展观,认真学习科学发展观努力做好本职工作。

科学技术是人类社会发展的原动力。要将“做科研”培养成为一种兴趣,将实践和科研思维结合进行科学创新,才能真正推进科技进步。要如何做研究?--从实践中发现问题;运用知识储备提出问题;依赖先进的技术研究问题;最终的研究成果,要不忘初心,应用于解决实践中的问题上。

读书学习之道,看似无边无际,不知何去何从,倘若勤奋刻苦,书山自有石经,学海岂无彼岸。故不积跬步,无以至千里;不积小流,无以成江海。作为科研工作者,吾应发奋图强,刻苦学习!方能在科研方面更进一步!

此内容为个人学习笔记,外人禁止查看

版权声明:1、本站整理的内容仅限用于个人学习体验和研究目的,本站提供的是资料整理收集服务。2、本站内容来自网络,版权归原作者所有,版权争议与本站无关,本站不对所涉及的版权问题负任何法律责任。如果您认为侵犯了您的合法权益,请联系我们删除。
内页下720*80广告

  • 关注微信

猜你喜欢

全站右侧370*310广告

关注我们

微信客服