中文题名: | 从统计角度阐述数据科学 |
姓名: | |
保密级别: | 公开 |
论文语种: | 中文 |
学科代码: | 070101 |
学科专业: | |
学生类型: | 学士 |
学位: | 理学学士 |
学位年度: | 2018 |
学校: | 北京师范大学 |
校区: | |
学院: | |
第一导师姓名: | |
第一导师单位: | |
提交日期: | 2018-05-10 |
答辩日期: | 2018-05-09 |
外文题名: | Explaining data science from a statistical point of view |
中文关键词: | |
中文摘要: |
本文旨在研究统计与数据科学的关系,从统计角度如何阐述数据科学.指明了原有的错误观点:目前很多人对于统计存在一定程度的偏见,对它的理解依旧停留在它的原始意义—计数和普查上,认为统计是理论的,一成不变的.但从历史发展的角度发现统计学家参与推动数据科学的发展,甚至扮演着主导地位.结合数据科学=SDCCC=〖SDC〗^3的公式,提出统计思维和方法是数据科学的核心,其他如计算机技术,领域知识等是辅助解决问题的工具.继续阐述了大数据时代统计面临的优势与挑战,作为统计人员,要批判性的审视统计与数据科学,不能盲目自信.为了顺利的应对数据科学的浪潮,应该注重团队合作,培养“数据智慧”的能力,加强计算能力,扩宽领域知识等.
﹀
|
外文摘要: |
This article aims to study the relationship between statistics and data science, and how to elaborate data science from a statistical point of view. The original misconceptions are pointed out: At present, many people have a certain degree of prejudice against statistics, and their understanding remains in its original meaning—counting and census. Statistics are considered theoretical and immutable. However, from the perspective of historical development, statisticians have found that statisticians are involved in promoting the development of data science and even play a dominant role. Combining the formula of “data science=SDCCC=SDC^3”, it is proposed that statistical thinking and methods are the core of data science. Others, such as computer technology and domain knowledge, are tools that assist in solving problems. We continued to elaborate on the advantages and challenges faced by the statistics in the era of big data. As statisticians, we must critically examine statistics and data science and we must not blindly believe in ourselves. To smoothly respond to the wave of data science, we should pay attention to teamwork, foster the ability of “data wisdom”, strengthen computing capabilities, and expand domain knowledge.
﹀
|
参考文献总数: | 17 |
馆藏号: | 本070101/18118 |
开放日期: | 2019-07-09 |