This commit is contained in:
Leolee11111 2021-07-03 22:30:00 +08:00
commit b68b2aa87d
82 changed files with 37891 additions and 0 deletions

Binary file not shown.

After

Width:  |  Height:  |  Size: 15 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 110 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 21 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 117 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 12 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 58 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 70 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 53 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 205 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 66 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 15 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 158 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 60 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 115 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 14 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 49 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 60 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 53 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 193 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 48 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 66 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 294 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 82 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 34 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 99 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 19 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 93 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 34 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 137 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 107 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 35 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 32 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 72 KiB

File diff suppressed because one or more lines are too long

File diff suppressed because one or more lines are too long

Binary file not shown.

After

Width:  |  Height:  |  Size: 31 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 28 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 34 KiB

Binary file not shown.

Binary file not shown.

After

Width:  |  Height:  |  Size: 516 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 12 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 68 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 49 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 96 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 316 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 168 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 113 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 11 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 174 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 110 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 616 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 474 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 400 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 360 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 73 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 149 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 67 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 210 KiB

File diff suppressed because one or more lines are too long

View File

@ -0,0 +1,252 @@
booster[0]:
0:[f1<1.79999995] yes=1,no=2,missing=1
1:leaf=-0.00167664688
2:leaf=0.0225523021
booster[1]:
0:[f0<12.7450008] yes=1,no=2,missing=1
1:[f0<12.2350006] yes=3,no=4,missing=3
3:leaf=-0.00262214988
4:leaf=0.00287974672
2:leaf=0.0214265939
booster[2]:
0:leaf=-0.00332116825
booster[3]:
0:leaf=-0.00334951468
booster[4]:
0:leaf=-0.00329787261
booster[5]:
0:leaf=-0.00336898421
booster[6]:
0:leaf=-0.00335616455
booster[7]:
0:leaf=-0.00335616455
booster[8]:
0:leaf=-0.00334951468
booster[9]:
0:leaf=-0.00330584752
booster[10]:
0:[f0<12.7800007] yes=1,no=2,missing=1
1:leaf=0.0251812898
2:leaf=0.000917753612
booster[11]:
0:[f1<2.00500011] yes=1,no=2,missing=1
1:leaf=0.0227872226
2:leaf=-0.00228677341
booster[12]:
0:leaf=-0.00330345333
booster[13]:
0:leaf=-0.00332601252
booster[14]:
0:leaf=-0.0033333532
booster[15]:
0:leaf=-0.00340164825
booster[16]:
0:leaf=-0.00330339023
booster[17]:
0:leaf=-0.00338479271
booster[18]:
0:leaf=-0.00329532544
booster[19]:
0:leaf=-0.00332611194
booster[20]:
0:[f0<12.7800007] yes=1,no=2,missing=1
1:leaf=0.0232013054
2:leaf=0.00143345271
booster[21]:
0:[f0<12.4899998] yes=1,no=2,missing=1
1:[f0<12.1800003] yes=3,no=4,missing=3
3:leaf=-0.00249572913
4:leaf=0.00314772734
2:leaf=0.018272765
booster[22]:
0:leaf=-0.003382419
booster[23]:
0:leaf=-0.00339378673
booster[24]:
0:leaf=-0.00333784032
booster[25]:
0:leaf=-0.00329276407
booster[26]:
0:leaf=-0.00337041426
booster[27]:
0:leaf=-0.00333078601
booster[28]:
0:leaf=-0.00330856745
booster[29]:
0:leaf=-0.00332364067
booster[30]:
0:[f1<1.98500001] yes=1,no=2,missing=1
1:leaf=0.000478318485
2:leaf=0.024260221
booster[31]:
0:[f0<12.7450008] yes=1,no=2,missing=1
1:[f0<12.2350006] yes=3,no=4,missing=3
3:leaf=-0.00249379966
4:leaf=0.00320278108
2:leaf=0.0188698322
booster[32]:
0:leaf=-0.0033738818
booster[33]:
0:leaf=-0.00331355026
booster[34]:
0:leaf=-0.00333534204
booster[35]:
0:leaf=-0.00327354297
booster[36]:
0:leaf=-0.00339683075
booster[37]:
0:leaf=-0.00331358286
booster[38]:
0:leaf=-0.00331365853
booster[39]:
0:leaf=-0.0033137179
booster[40]:
0:[f1<2.21000004] yes=1,no=2,missing=1
1:leaf=-0.000693901035
2:leaf=0.0233493987
booster[41]:
0:[f0<12.7450008] yes=1,no=2,missing=1
1:[f0<12.2350006] yes=3,no=4,missing=3
3:leaf=-0.00244552107
4:leaf=0.00326615479
2:leaf=0.0188130029
booster[42]:
0:leaf=-0.00333267869
booster[43]:
0:leaf=-0.00331851956
booster[44]:
0:leaf=-0.00331858872
booster[45]:
0:leaf=-0.00335928169
booster[46]:
0:leaf=-0.00331845111
booster[47]:
0:leaf=-0.00332558574
booster[48]:
0:leaf=-0.0032710426
booster[49]:
0:leaf=-0.00336550688
booster[50]:
0:[f0<12.8050003] yes=1,no=2,missing=1
1:leaf=0.0211855415
2:leaf=-0.00144048361
booster[51]:
0:[f1<2.20499992] yes=1,no=2,missing=1
1:leaf=0.019391967
2:leaf=-0.00215333444
booster[52]:
0:leaf=-0.00335702649
booster[53]:
0:leaf=-0.00333733112
booster[54]:
0:leaf=-0.00336306286
booster[55]:
0:leaf=-0.00338677107
booster[56]:
0:leaf=-0.00335062738
booster[57]:
0:leaf=-0.0033568542
booster[58]:
0:leaf=-0.00335071259
booster[59]:
0:leaf=-0.00329332263
booster[60]:
0:[f0<12.7800007] yes=1,no=2,missing=1
1:leaf=0.02029467
2:leaf=0.00275001978
booster[61]:
0:[f0<12.1800003] yes=1,no=2,missing=1
1:leaf=-0.00244074967
2:[f0<12.7450008] yes=3,no=4,missing=3
3:leaf=0.00628815545
4:leaf=0.0183121879
booster[62]:
0:leaf=-0.00332106487
booster[63]:
0:leaf=-0.00331367063
booster[64]:
0:leaf=-0.00327475765
booster[65]:
0:leaf=-0.00332799996
booster[66]:
0:leaf=-0.00326598599
booster[67]:
0:leaf=-0.0033670105
booster[68]:
0:leaf=-0.00316595682
booster[69]:
0:leaf=-0.00332104694
booster[70]:
0:[f0<12.7650003] yes=1,no=2,missing=1
1:leaf=0.0203699265
2:leaf=0.00137320568
booster[71]:
0:[f0<12.4650002] yes=1,no=2,missing=1
1:leaf=-0.00194271619
2:leaf=0.0175629407
booster[72]:
0:leaf=-0.00331112696
booster[73]:
0:leaf=-0.00330391992
booster[74]:
0:leaf=-0.00335213495
booster[75]:
0:leaf=-0.00331101543
booster[76]:
0:leaf=-0.00331097725
booster[77]:
0:leaf=-0.003311123
booster[78]:
0:leaf=-0.0033186723
booster[79]:
0:leaf=-0.00326352566
booster[80]:
0:[f0<12.7450008] yes=1,no=2,missing=1
1:leaf=0.020401584
2:leaf=-0.000695777766
booster[81]:
0:[f1<2.08999991] yes=1,no=2,missing=1
1:leaf=0.0195696633
2:[f1<2.7750001] yes=3,no=4,missing=3
3:leaf=0.000353478739
4:leaf=-0.00255760807
booster[82]:
0:leaf=-0.00326074357
booster[83]:
0:leaf=-0.00334964832
booster[84]:
0:leaf=-0.00337391347
booster[85]:
0:leaf=-0.00336799235
booster[86]:
0:leaf=-0.0033680005
booster[87]:
0:leaf=-0.00334327272
booster[88]:
0:leaf=-0.00323429005
booster[89]:
0:leaf=-0.00324315438
booster[90]:
0:[f1<2.05499983] yes=1,no=2,missing=1
1:leaf=-0.00168179313
2:leaf=0.0211347807
booster[91]:
0:[f0<12.5249996] yes=1,no=2,missing=1
1:leaf=-0.000959251658
2:leaf=0.0150438426
booster[92]:
0:leaf=-0.00325799454
booster[93]:
0:leaf=-0.00334080867
booster[94]:
0:leaf=-0.00328319846
booster[95]:
0:leaf=-0.00332048861
booster[96]:
0:leaf=-0.00333417533
booster[97]:
0:leaf=-0.00329860556
booster[98]:
0:leaf=-0.00327543961
booster[99]:
0:leaf=-0.00334100495

Binary file not shown.

After

Width:  |  Height:  |  Size: 32 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 61 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 26 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 131 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 31 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 36 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 94 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 112 KiB

File diff suppressed because one or more lines are too long

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because one or more lines are too long

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because one or more lines are too long

92
readme.md Normal file
View File

@ -0,0 +1,92 @@
# 开源项目:机器学习集成学习与模型融合(基于python)
## (一) 项目初衷
这件事开始于我们参加一些机器学习比赛在类似于kaggle等大型数据科学竞赛的时候, 我们总喜欢去观摩高分选手的一些参赛模型得出一个很重要的结论就是除了深度学习以外的高分模型无一例外地见到了集成学习和模型融合的身影。这个发现迫使我去学习一些除了基础模型以外的集成学习方法以便在这些比赛上获得更好的成绩。但是当我使用具体的sklearn编程的时候, 往往因为不懂得集成学习的一 些底层知识而不懂参数的含义。因此,在本项目中我们会从基础模型的推导以及 sklearn应用过渡到使用集成学习的技术去优化我们的基础模型使得我们的模型能更好地解决机器学习问题。
## (二) 内容设置
- 第一章python数据处理基础待完善
- 第二章:机器学习基础
- 机器学习的三大主要任务
- 基本的回归模型
- 偏差与方差理论
- 回归模型的评估及超参数调优
- 基本的分类模型
- 分类问题的评估及超参数调优
- 第三章集成学习之投票法与Bagging
- 投票法的思路
- 投票法的原理分析
- 投票法的案例分析(基于sklearn介绍pipe管道的使用以及voting的使用)
- Bagging的思路
- Bagging的原理分析
- Bagging的案例分析(基于sklearn介绍随机森林的相关理论以及实例)
- 第四章集成学习之Boosting提升法
- Boosting的思路与Adaboost算法
- 前向分步算法与梯度提升决策树(GBDT)
- XGBoost算法与xgboost库的使用
- Xgboost算法案例与调参实例
- LightGBM算法的基本介绍
- 第五章集成学习之Blending与Stacking
- Blending集成学习算法
- Stacking集成学习算法
- Blending集成学习算法与Stacking集成学习算法的案例分享
- 第六章:集成学习之案例分析
- 集成学习案例一 (幸福感预测)
- 集成学习案例二 (蒸汽量预测)
## (三) 人员安排
| 成员 | 个人简介 | 个人主页 |
| ------ | ----------------------------------------------------- | ------------------------------------------------- |
| 李祖贤 | Datawhale成员项目负责人深圳大学数学与应用数学专业 | https://www.zhihu.com/people/meng-di-76-92/posts |
| 赵可 | Datawhale成员国家电网电气工程师 | |
| 杨毅远 | Datawhale成员清华大学自动化系研二 | https://yyysjz1997.github.io/ |
| 薛传雨 | Datawhale成员康涅狄格大学在读博士 | http://chuanyuxue.com/ |
| 陈琰钰 | Datawhale成员清华大学深圳研究生院研一 | https://cyy0214.github.io/ |
| 李嘉骐 | 清华大学自动化系在读博士 | https://www.zhihu.com/people/li-jia-qi-16-9/posts |
教程贡献情况:
李祖贤: CH2、CH4、CH5
薛传雨CH3
杨毅远CH6
李嘉骐CH3优化
组队学习贡献情况:
赵可长期学习流程设计、组织协调、23期运营&作业评审task1、24期运营&作业评审task3
李祖贤23期答疑&直播3次&作业评审(task3&4)、24期答疑&直播3次&作业评审(task4&5)
薛传雨23期运营&答疑&作业评审task5、24期运营
陈琰钰23期作业评审task2&6
杨毅远23期答疑
李嘉骐24期答疑&作业评审task1&2
(四) 课程编排与使用方法
- 课程编排:
课程现分为三个阶段,大致可以分为:机器学习模型回顾,集成学习方法的进阶, 集成学习项目的实践。
1. 第一部分:当我们可以比较熟练的操作数据并认识这个数据之后,我们需要开始数据清洗以及重
构, 将原始数据变为一个可用好用的数据, 基于sklearn构建模型以及模型评价在这个部分我们会重点详细学习各个基础模型的原理以及sklearn的各个参数。
2. 第二部分我们要使用sklearn, xgboost, lightgbm以及mIxtend库去学习集成学习的具体方法以及原理底层。
3. 第三单元:通过前面的理论学习,现在可以开始进行实践了,这里有两个大型集成学习项目的实践。
- 使用方法:
我们的代码都是jupyter notebook和markdown形式, 我们在每一章后面会给出几道小习题方便大家掌握。其中的内容会作为组队学习的项目安排!
## (五) 反馈
- 如果有任何想法可以联系邮箱 (1028851587@qq.com)
- 如果有任何想法可以联系我们DataWhale