forked from intro-llm/intro-llm.github.io
-
Notifications
You must be signed in to change notification settings - Fork 0
/
index.html
executable file
·322 lines (253 loc) · 13 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
<!DOCTYPE html>
<html lang="en" class="no-js">
<head>
<meta charset="UTF-8" />
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<title>大规模语言模型:从理论到实践</title>
<meta name="viewport" content="width=device-width, initial-scale=1, maximum-scale=1" />
<meta name="description" content="大规模语言模型:从理论到实践" />
<meta name="keywords" content="Natural language Processing, NLP, Deep Learning, Neural Networks" />
<meta name="author" content="lmpixels" />
<link rel="shortcut icon" href="favicon.ico">
<link rel="stylesheet" href="css/bootstrap.min.css" type="text/css">
<link rel="stylesheet" href="css/animate.css" type="text/css">
<link rel="stylesheet" href="css/animations.css" type="text/css">
<link rel="stylesheet" href="css/owl.carousel.css" type="text/css">
<link rel="stylesheet" href="css/magnific-popup.css" type="text/css">
<link rel="stylesheet" href="css/main.css" type="text/css">
<script src="js/modernizr.custom.js"></script>
</head>
<body>
<!-- Loading animation -->
<div class="preloader">
<div class="preloader-animation">
<div class="preloader-spinner">
</div>
</div>
</div>
<!-- /Loading animation -->
<div id="page" class="page one-page-style">
<!-- Header -->
<header id="site_header" class="header mobile-menu-hide">
<div class="header-content clearfix">
<div class="my-photo">
<img src="images/logo.png" alt="image">
</div>
<div class="site-title-block">
<div class="site-title">大规模语言模型:从理论到实践</div>
</div>
<!-- Navigation -->
<div class="site-nav">
<!-- Main menu -->
<ul id="nav" class="site-main-menu">
<li>
<a class="pt-trigger" href="#home">关于本书</a><!-- href value = data-id without # of .pt-page. -->
</li>
<li>
<a class="pt-trigger" href="#chapter">章节内容</a>
</li>
<li>
<a class="pt-trigger" href="#cite">引用信息</a>
</li>
<li>
<a class="pt-trigger" href="#contact">反馈意见</a>
</li>
</ul>
<!-- /Main menu -->
</div>
<!-- Navigation -->
<!-- Social Links -->
<div class="social-links">
<a href="https://github.com/intro-llm/intro-llm.github.io" target="_blank"><i class="fa-github"></i></a>
</div>
<!-- / Social Links -->
<!-- Copyrights -->
<div class="copyrights">Fudan University NLP Group © 2023 All rights reserved.</div>
<!-- / Copyrights -->
</div>
</header>
<!-- /Header -->
<!-- Mobile Header -->
<div class="mobile-header mobile-visible">
<div class="mobile-logo-container">
<div class="mobile-header-image">
<a href="#">
<img src="images/logo.jpg" alt="image">
</a>
</div>
<div class="mobile-site-title"><a href="#">大语言模型理论与实践</a></div>
</div>
<a class="menu-toggle mobile-visible">
<i class="fa fa-bars"></i>
</a>
</div>
<!-- /Mobile Header -->
<!-- Arrows Nav -->
<div class="lmpixels-scroll-to-top"><i class="lnr lnr-chevron-up"></i></div>
<!-- /Arrows Nav -->
<!-- Main Content -->
<div id="main" class="site-main">
<!-- Page changer wrapper -->
<div class="pt-wrapper">
<!-- Subpages -->
<div class="subpages">
<!-- Home Subpage -->
<section id="home" class="pt-page pt-page-home ">
<div class="section-inner custom-page-content">
<div class="section-title-block second-style">
<h2 class="section-title">关于本书</h2>
</div>
<div class="section-content">
<div class="row bs-30">
<div class="col-xs-12 col-sm-12">
大语言模型(Large Language Models,LLM)是一种由包含数百亿以上权重的深度神经网络构建的语言模型,使用自监督学习方法通过大量无标记文本进行训练。自2018年以来,包含Google、OpenAI、Meta、百度、华为等公司和研究机构都纷纷发布了包括BERT, GPT等在内多种模型,并在几乎所有自然语言处理任务中都表现出色。2021年开始大模型呈现爆发式的增长,特别是2022年11月ChatGPT发布后,更是引起了全世界的广泛关注。用户可以使用自然语言与系统交互,从而实现包括问答、分类、摘要、翻译、聊天等从理解到生成的各种任务。大型语言模型展现出了强大的对世界知识掌握和对语言的理解。本书将介绍大语言模型的基础理论包括语言模型、分布式模型训练以及强化学习,并以Deepspeed-Chat框架为例介绍实现大语言模型和类ChatGPT系统的实践。
</div>
</div>
<!-- Services Block -->
<div class="row">
<div class="col-xs-12 col-sm-3" align="center">
<div class="author-photo">
<img src="images/zhangqi.png" alt="image">
</div>
<div align="center">
<h4>张奇</h4>
<p> 复旦大学,计算机科学技术学院,教授</p>
</div>
</div>
<div class="col-xs-12 col-sm-3" align="center">
<div class="author-photo">
<img src="images/guitao.jpg" alt="image">
</div>
<div align="center">
<h4>桂韬</h4>
<p> 复旦大学,计算语言学研究院,青年副研究员</p>
</div>
</div>
<div class="col-xs-12 col-sm-3" align="center">
<div class="author-photo">
<img src="images/zhengrui.jpg" alt="image">
</div>
<div align="center">
<h4>郑锐</h4>
<p> 复旦大学,计算机科学技术学院,博士研究生</p>
</div>
</div>
<div class="col-xs-12 col-sm-3" align="center">
<div class="author-photo">
<img src="images/huangxuanjing.jpg" alt="image">
</div>
<div align="center">
<h4>黄萱菁</h4>
<p> 复旦大学,计算机科学技术学院,教授</p>
</div>
</div>
</div>
</div>
</section>
<!-- End of Home Subpage -->
<!-- Chapter Subpage -->
<section class="pt-page" id="chapter">
<div class="section-inner custom-page-content">
<div class="section-title-block second-style">
<h2 class="section-title">章节内容</h2>
</div>
<h3 class="section-title">下载当前版本:<a href="./chapter/LLM-TAP.pdf"> 完整版本</a><a href="https://pan.baidu.com/s/1smGQ5ECzDIpvZladuCE59g?pwd=jyz6"> 完整版-百度网盘 </a> </h3>
<div class="row bs-10">
<div class="col-xs-9 col-sm-9">
<h3 class="section-title">课件:</h3>
<ul>
<li><a href="./slides/ch1.pptx">第一章 绪论</a></li>
<li><a href="./slides/ch2.pptx">第二章 大语言模型基础</a></li>
<li><a href="./slides/ch3.pptx">第三章 大语言模型预训练数据</a></li>
<li><a href="./slides/ch4.pptx">第四章 分布式模型训练</a></li>
<li><a href="./slides/ch5.pptx">第五章 有监督微调</a></li>
<li><a href="./slides/ch6.pptx">第六章 强化学习</a></li>
<li><a href="./slides/ch7.pptx">第七章 大语言模型应用</a></li>
<li><a href="./slides/ch8.pptx">第八章 大语言模型评估</a></li>
</ul>
</ul>
</h3>
</div>
</div>
</div>
</section>
<!-- Cite Subpage -->
<section class="pt-page" id="cite">
<div class="section-inner custom-page-content">
<div class="section-title-block second-style">
<h2 class="section-title">引用信息</h2>
</div>
<div class="row bs-10">
<div class="col-xs-6 col-sm-6">
<div class="alert alert-dark" role="alert">
张奇、桂韬、郑锐、黄萱菁,大语言模型理论与实践,https://intro-llm.github.io/, 2023.
</div>
<div class="alert alert-dark" role="alert">
@book{zhang2023introllm, <br>
title = {大规模语言模型:从理论到实践},<br>
publisher = {},<br>
year = {2023},<br>
author = {张奇、桂韬、郑锐、黄萱菁},<br>
address = {上海},<br>
isbn = {},<br>
url = {https://intro-llm.github.io/},<br>
}
</div>
</div>
</div>
</section>
<!-- Contact Subpage -->
<section class="pt-page" id="contact">
<div class="section-inner custom-page-content">
<div class="section-title-block second-style">
<h2 class="section-title">反馈意见</h2>
</div>
<div class="row bs-10">
<div class="col-xs-9 col-sm-9">
<p>如果您有任何意见、评论以及建议请通过GitHub的<a href="https://github.com/intro-llm/intro-llm.github.io/issues" target="_blank">Issues</a>页面进行反馈。</p>
<p>反馈意见包括但不限于:
<br>
<ul>
<li>错别字</li>
<li>描述错误</li>
<li>定义错误</li>
<li>建议</li>
</ul>
</p>
</div>
<div class="col-xs-12 col-sm-3">
<div class="col-inner bs-30">
<div class="lm-info-block gray-default">
<i class="lnr lnr-envelope"></i>
<h4>[email protected]</h4>
<span class="lm-info-block-value"></span>
<span class="lm-info-block-text"></span>
</div>
</div>
</div>
</div>
</div>
</section>
<!-- End Contact Subpage -->
</div>
</div>
<!-- /Page changer wrapper -->
</div>
<!-- /Main Content -->
</div>
<script src="js/jquery-2.1.3.min.js"></script>
<script src="js/bootstrap.min.js"></script>
<script src="js/imagesloaded.pkgd.min.js"></script>
<script src="js/jquery.malihu.PageScroll2id.min.js"></script>
<script src="js/validator.js"></script>
<script src="js/jquery.shuffle.min.js"></script>
<script src="js/masonry.pkgd.min.js"></script>
<script src="js/owl.carousel.min.js"></script>
<script src="js/jquery.magnific-popup.min.js"></script>
<!--<script src='https://www.google.com/recaptcha/api.js'></script>-->
<!--<script src="https://maps.googleapis.com/maps/api/js?key=YOUR-API-KEY"></script>-->
<!--<script src="https://maps.googleapis.com/maps/api/js"></script>-->
<!--<script src="js/jquery.googlemap.js"></script>-->
<script src="js/main.js"></script>
</body>
</html>