中国综合性科技类核心期刊(北大核心)

中国科学引文数据库来源期刊(CSCD)

美国《化学文摘》(CA)收录

美国《数学评论》(MR)收录

俄罗斯《文摘杂志》收录

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

R语言在生物科学研究绘图中的应用

蓝洋 何秀 朱诚勖 张玉娟

蓝洋, 何秀, 朱诚勖, 张玉娟. R语言在生物科学研究绘图中的应用[J]. 华东师范大学学报(自然科学版), 2019, (1): 124-135, 143. doi: 10.3969/j.issn.1000-5641.2019.01.014
引用本文: 蓝洋, 何秀, 朱诚勖, 张玉娟. R语言在生物科学研究绘图中的应用[J]. 华东师范大学学报(自然科学版), 2019, (1): 124-135, 143. doi: 10.3969/j.issn.1000-5641.2019.01.014
LAN Yang, HE Xiu, ZHU Cheng-xu, ZHANG Yu-juan. Application of R language graphics in biological research[J]. Journal of East China Normal University (Natural Sciences), 2019, (1): 124-135, 143. doi: 10.3969/j.issn.1000-5641.2019.01.014
Citation: LAN Yang, HE Xiu, ZHU Cheng-xu, ZHANG Yu-juan. Application of R language graphics in biological research[J]. Journal of East China Normal University (Natural Sciences), 2019, (1): 124-135, 143. doi: 10.3969/j.issn.1000-5641.2019.01.014

R语言在生物科学研究绘图中的应用

doi: 10.3969/j.issn.1000-5641.2019.01.014
基金项目: 

国家自然科学基金 31871274

重庆市教育委员会科学技术研究项目 KJ1600304

重庆市科委基础研究与前沿探索项目 cstc2018jcyjA2487

详细信息
    作者简介:

    蓝洋, 女, 硕士研究生, 研究方向为生物信息.E-mail:908805060@qq.com

    通讯作者:

    张玉娟, 女, 教授, 硕士生导师, 研究方向为生物信息、化学计量组学、比较基因组学.E-mail:zhangyj@cqnu.edu.cn

  • 中图分类号: Q811.4

Application of R language graphics in biological research

  • 摘要: R语言具有强大的数据分析处理和可视化绘图功能,可以在Window、Linux以及Mac系统上使用,并且根据其编写新代码或调整已有代码可轻松实现科研中数据呈现与图形绘制的要求.然而其代码学习较为艰难、R-package使用复杂,所以未受到科研新手的青睐.基于生物科学相关领域的背景,整理汇集已出版的文献、公共数据库以及国家统计局中提供的数据,使用R语言、R编辑器RStudio并载入相关的R-package绘制出地图、热图、关联网络图、韦恩图和柱形图等高质量的图片,并提供相应的脚本与说明,以方便生物科研人员直接更改使用.本研究对生物科研中合理、直观地表述研究结果提供了良好的范例,并进行了详细讨论,且与其他绘图软件作了比较,以期R语言能够成为生物科学领域科研工作者入门学习、研究应用中绘制图片的首选工具.
  • 图  1  近10年33个省份西瓜平均产量

    Fig.  1  The average yield of watermelon crops across 33 provinces in the last 10 years

    图  2  生物胁迫RNA-seq在一天中3个时刻的表达量的变化

    Fig.  2  RNA-seq data showing subsets of genes for biotic stress at three instances per-day

    图  3  HSP 18相互作用蛋白质

    Fig.  3  Proteins interacting with HSP 18

    图  4  在不同的储存条件下黑腹果蝇的静止幼虫显著上调的mRNA转录物的数量

    Fig.  4  The number of significantly up-regulated mRNA transcripts in quiescent larvae of Drosophila melanogaster stored at different conditions compared to the start of the experiment

    图  5  在4周内DPPH自由基清除能力

    Fig.  5  Effect of DPPH radical scavenging activity over 4 weeks

    表  1  10种常用R-package信息介绍

    Tab.  1  Introduction to common R-packages

    序号 R-package 作用特点 开发时间 参考文献
    1 ggplot2 使用图形语法创建数据可视化 2013年10月 [29]
    2 lattice 复合图像组合图形 2008年 [30]
    3 VennDiagram 绘制Venn图 2011年1月 [31]
    4 Metacoder 群体数据分类并以树形式可视化显示 2017年2月 [32]
    5 treeman 高效和直观地构建系统发育树 2017年1月 [33]
    6 DGCA 差异基因分析 2016年11月 [34]
    7 vcfr 操作和可视化VCF数据 2017年1月 [35]
    8 sfinx 消除亲和纯化-质谱数据集中的假阳性 2017年2月 [36]
    9 snpGeneSets 基因组学研究注释 2016年12月 [37]
    10 Gogadget GO富集结果的分析和可视化 2016年12月 [38]
    下载: 导出CSV
  • [1] DANIELSSON J. An Introduction to R[M]. System Dynamics Modeling with R. New York:Springer International Publishing, 2016.
    [2] CARSON M A, BASILIKO N. Approaches to R education in Canadian universities[J]. F1000 research, 2016(5):1-18. http://d.old.wanfangdata.com.cn/OAPaper/oai_doaj-articles_3746541b53be4f61660408658ce118f9
    [3] SEIFERT E. OriginPro 9.1:Scientific data analysis and graphing software-Software review[J]. Journal of Chemical Information & Modeling, 2014, 54(5):1552. http://med.wanfangdata.com.cn/Paper/Detail/PeriodicalPaper_PM24702057
    [4] WASS J A. SigmaPlot 11:Now with total sigmaStat integration[J]. Scientific Computing, 2009, 26(1):21-25. https://www.questia.com/hbr-welcome
    [5] MCCORMICK K, SALCEDO J. SPSS statistics for data analysis and visualization[J]. Drug Testing & Analysis, 2017, 1(6):250-266.
    [6] GENTLEMAN R, HUBER W, CAREY V J. R Language[M]. New York:Springer Berlin Heidelberg, 2011.
    [7] GANCHEV D H. Using R language for statistical computing for pesticide application calculations[J]. MAYFEB Journal of Agricultural Science, 2016(1):10-26.
    [8] FELD C K, SEGURADO P, GUTIERREZ-CANOVAS C. Analysing the impact of multiple stressors in aquatic biomonitoring data:A 'cookbook' with applications in R[J]. Science of the Total Environment, 2016, 573:1320-1339. doi:  10.1016/j.scitotenv.2016.06.243
    [9] GUENZI D, FRATIANNI S, BORASO R, et al. CondMerg:An open source implementation in R language of conditional merging for weather radars and rain gauges observations[J]. Earth Science Informatics, 2016, 10(1):1-9. https://www.onacademic.com/detail/journal_1000039700528210_71a8.html
    [10] LI J, WANG J, CHEN Y, et al. A prognostic 4-gene expression signature for squamous cell lung carcinoma[J]. Journal of Cellular Physiology, 2017, 232(12):3702-3713. doi:  10.1002/jcp.v232.12
    [11] ZHAI J, HSU C H, DAYE Z J. Ridle for sparse regression with mandatory covariates with application to the genetic assessment of histologic grades of breast cancer[J]. Bmc Medical Research Methodology, 2017, 17(1):12-24. doi:  10.1186/s12874-017-0291-y
    [12] WANG Z, ZHANG C, LIU X, et al. Molecular and clinical characterization of PD-L1 expression at transcriptional level via 976 samples of brain glioma. Oncoimmunology[J], 2016, 5(11):e1196310. doi:  10.1080/2162402X.2016.1196310
    [13] OH D H, KIM I B, KIM S H, et al. Predicting autism spectrum disorder using blood-based gene expression signatures and machine learning[J]. Clin Psychopharmacol Neurosci, 2017, 15(1):47-52. https://www.onacademic.com/detail/journal_1000040168348810_dfee.html
    [14] WOJCIECHOWSKI J, HOPKINS A M, UPTON R N. Interactive pharmacometric applications using R and the shiny package[J]. CPT Pharmacometrics Syst Pharmacol, 2015, 4(3):e00021. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4394611/
    [15] LARRIBA Y, RUEDA C, FERNANDEZ M A, et al. Order restricted inference for oscillatory systems for detecting rhythmic signals.[J]. Nucleic Acids Research, 2016, 44(22):e163. doi:  10.1093/nar/gkw771
    [16] PARADIS E, GOSSELIN T, GRUNWALD N J, et al. Towards an integrated ecosystem of R packages for the analysis of population genetic data[J]. Molecular Ecology Resources, 2017, 17(1):1-4. doi:  10.1111/men.2017.17.issue-1
    [17] MILLER F P, VANDOME A F, MCBREWSTER J, et al. R (programming language)[J]. Betascript Publishing, 2010, 6(2):36-40. http://d.old.wanfangdata.com.cn/Periodical/jsjgcysj201310028
    [18] CHAPMAN C, FEIT E M D. An Overview of the R Language[M]//CHAPMAN C, FEIT E M D. R for Marketing Research and Analytics. New York: Springer International Publishing, 2015.
    [19] 王怀亮.箱须图在识别统计数据异常值中的作用及R语言实现[J].商业经济, 2011(5):64-65. http://d.old.wanfangdata.com.cn/Periodical/shangyyj201105029
    [20] 王怀亮.基于R语言的统计数据柱形图的实现[J].电子技术, 2013(8):78-80. http://d.old.wanfangdata.com.cn/Periodical/jzgc201808093
    [21] 石蕾. R语言在藓类形态与遗传变异研究中的应用[D].上海: 上海师范大学, 2015.
    [22] 兰国玉, 陈伟, 王继坤, 等. R语言在橡胶林动态监测地形图绘制方面的应用[J].热带农业科学, 2013, 33(3):50-53. doi:  10.3969/j.issn.1009-2196.2013.03.012
    [23] 纪相禹.基于R语言的差异表达基因检测研究[D].长春: 吉林大学. 2011
    [24] DOBRE G R. R Language:Statistical computing and graphics for modeling hydrologic time series[J]. Mathematical Modeling in Civil Engineering, 2014, 10(4):9-18.
    [25] CRAN. The Comprehensive R Archive Network[DB/OL].[2017-05-01].https://cran.r-project.org/.
    [26] HORNIK K. The comprehensive R archive network[J]. Wiley Interdisciplinary Reviews Computational Statistics, 2012, 4(4):394-398. doi:  10.1002/wics.1212
    [27] IHAKA R, GENTLEMAN R. R:A language for data analysis and graphics[J]. Journal of Computational & Graphical Statistics, 1996, 5(3):299-314. http://d.old.wanfangdata.com.cn/Periodical/xyjsclygc201001020
    [28] VANCE A. Data analysts captivated by R's power[N]. New York Times, 2009-01-06(Business Computing).
    [29] ITO K, MURPHY D. Application of ggplot2 to pharmacometric graphics[J]. Cpt Pharmacometrics & Systems Pharmacology, 2013, 2(10):1-16. https://www.ncbi.nlm.nih.gov/pubmed/24132163
    [30] SARKAR, DEEPAYAN. Lattice:Multivariate Data Visualization with R[M]. New York:Springer, 2008.
    [31] CHEN H, BOUTROS P C. VennDiagram:A package for the generation of highly-customizable Venn and Euler diagrams in R[J]. Bmc Bioinformatics, 2011, 12(1):35-41. http://d.old.wanfangdata.com.cn/OAPaper/oai_pubmedcentral.nih.gov_3041657
    [32] FOSTER Z S, SHARPTON T J, GRUNWALD N J. Metacoder:An R package for visualization and manipulation of community taxonomic diversity data[J]. Plos Computational Biology, 2017, 13(2):e1005404. doi:  10.1371/journal.pcbi.1005404
    [33] BENNETT D J, SUTTON M D, TURVEY S T. Treeman:An R package for efficient and intuitive manipulation of phylogenetic trees[J]. Bmc Research Notes, 2017, 10(1):30-39. doi:  10.1186/s13104-016-2340-8
    [34] MCKENZIE A T, KATSYV I, SONG W M, et al. DGCA:A comprehensive R package for differential gene correlation analysis[J]. Bmc Systems Biology, 2016, 10(1):106-130. http://d.old.wanfangdata.com.cn/NSTLQK/NSTL_QKJJ0225868648/
    [35] KNAUS B J, GRUNWALD N J. Vcfr:A package to manipulate and visualize variant call format data in r[J]. Molecular Ecology Resources, 2017, 17(1):44-53. doi:  10.1111/men.2017.17.issue-1
    [36] TITECA K, MEYSMAN P, LAUKENS K, et al. Sfinx:An R package for the elimination of false positives from affinity purification-mass spectrometry datasets.[J]. Bioinformatics, 2017, 33(12):1902-1904. doi:  10.1093/bioinformatics/btx076
    [37] MEI H, LI L, JIANG F, et al. snpGeneSets:An R package for genome-wide study annotation[J]. G3(Bethesda, Md.), 2016, 6(12):4087-4095. doi:  10.1534/g3.116.034694
    [38] NOTA B. Gogadget:An R package for interpretation and visualization of GO enrichment results[J]. Molecular Informatics, 2017, 36:5-6. https://www.ncbi.nlm.nih.gov/pubmed/28000438?dopt=Abstract
    [39] 中国测绘网.地图制图概念(词汇解释)[EB/OL]. (2013-08-28)[2017-05-15].http://www.cehui8.com/zhuanti/map/20130828/1191.html.
    [40] 房世波, 齐月, 韩国军, 等. 1961-2010年中国主要麦区冬春气象干旱趋势及其可能影响[J].中国农业科学, 2014, 47(9):1754-1763. doi:  10.3864/j.issn.0578-1752.2014.09.010
    [41] 国家统计局.国家数据[EB/OL].[2017-05-15]. http://data.stats.gov.cn/easyquery.htm?cn=E0103.
    [42] 丁香园.热图的解读与应用[EB/OL]. (2016-02-01)[2017-05-15]. http://www.dxy.cn/bbs/topic/32836042?source=rss.
    [43] CARUSO S, MIRAGLIA R, MARUZZELLI L, et al. Chaperone activity of tobacco HSP18, a small heat-shock protein, is inhibited by ATP[J]. Plant Journal, 2000, 23(6):703-713. doi:  10.1046/j.1365-313x.2000.00837.x
    [44] 张厚品.韦恩图的起源[J].数学教学, 2005(7):48-49. http://d.old.wanfangdata.com.cn/Periodical/xxqnjs201018024
    [45] MICALLEF L, RODGERS P. Euler APE:Drawing area-proportional 3-venn diagrams using ellipses[J]. Plos One, 2014, 9(7):e101717. doi:  10.1371/journal.pone.0101717
    [46] KOSTAL V, KORBELOVA J, STETINA T, et al. Physiological basis for low-temperature survival and storage of quiescent larvae of the fruit fly Drosophila melanogaster[J]. Scientific Reports, 2016(6):32346-32356.
    [47] 李红, 张元湖.应用DPPH?法测定苹果提取物的抗氧化能力[J].山东农业大学学报(自然科学版), 2005, 36(1):35-38. doi:  10.3969/j.issn.1000-2324.2005.01.007
    [48] 孙存普.自由基生物学导论[M].合肥:中国科学技术大学出版社, 1999.
    [49] JIN S K, HA S R, CHOI J S. Effect of Caesalpinia sappan L. extract on physico-chemical properties of emulsiontype pork sausage during cold storage[J]. Meat Science, 2015, 110:245-252. doi:  10.1016/j.meatsci.2015.08.003
    [50] MENTE S, KUHN M. The use of the R language for medicinal chemistry applications[J]. Current Topics in Medicinal Chemistry, 2012, 12(18):1957-1964. doi:  10.2174/156802612804910322
  • 加载中
图(5) / 表(1)
计量
  • 文章访问数:  157
  • HTML全文浏览量:  192
  • PDF下载量:  213
  • 被引次数: 0
出版历程
  • 收稿日期:  2017-12-06
  • 刊出日期:  2019-01-25

目录

    /

    返回文章
    返回