2024 年 4 月

如何使用 FastAPI？

下面的步骤将帮助你开始使用 FastAPI 构建一个简单的 Web 应用。

1.安装 FastAPI：在命令行中运行以下命令安装 fastapi，并安装 uvicorn ：

pip install fastapi

2. 创建 FastAPI 应用：在你喜欢的 IDE 编辑器中创建一个新的 Python 文件，例如fastapi-demo.py。然后将以下代码复制到文件中：

from fastapi import FastAPI

app = FastAPI()

@app.get("/")
def read_root():
    return {"Hello": "World"}

上面的代码创建了一个名为 app 的 FastAPI 实例，并定义了一个根路由，返回一个简单的 JSON 响应。

3. 运行 FastAPI 应用：在命令行中运行以下命令启动 FastAPI 应用：

uvicorn main:app --reload

FastAPI 将在本地启动一个服务器，并监听默认端口（8000）。你可以在浏览器中访问 http://127.0.0.1:8000，看到 {"Hello": "World"} 的响应。

4. 添加更多的路由和功能：你可以继续在应用中添加更多的路由和功能，根据自己的需求进行扩展：

@app.get("/items/{item_id}")
def read_item(item_id: int, q: str = None):
    return {"item_id": item_id, "q": q}

上面的代码添加了一个名为 read_item 的新路由，它接受一个 item_id 参数和一个可选的 q 参数，并返回一个 JSON 响应。

实践案例

以下是一个使用 FastAPI 构建的实践案例，能够在 IDE 编辑器中直接运行的代码。

from fastapi import FastAPI

app = FastAPI()

@app.get("/")
def read_root():
    return {"Hello": "World"}

@app.get("/items/{item_id}")
def read_item(item_id: int, q: str = None):
    return {"item_id": item_id, "q": f"接口id：{item_id}"}

if __name__ == "__main__":
    import uvicorn
    uvicorn.run(app, host="127.0.0.1", port=8000)

复制以上代码到你的 IDE 编辑器中，并运行它。然后通过浏览器访问 http://localhost:8000/，你将看到 {"Hello": "World"} 的响应。同样地，你也可以访问 http://localhost:8000/items/42?q=somequery，查看带有参数的响应。

轻量快速的 Python ASGI 框架 uvicorn

什么是 Uvicorn ？

答：Uvicorn 是基于 uvloop 和 httptools 构建的非常快速的 ASGI 服务器。

什么是 uvloop 和 httptools ？

答： uvloop 用于替换标准库 asyncio 中的事件循环，使用 Cython 实现，它非常快，可以使 asyncio 的速度提高 2-4 倍。asyncio 不用我介绍吧，写异步代码离不开它。

httptools 是 nodejs HTTP 解析器的 Python 实现。

什么是 ASGI 服务器？

答：异步网关协议接口，一个介于网络协议服务和 Python 应用之间的标准接口，能够处理多种通用的协议类型，包括 HTTP，HTTP2 和 WebSocket。

请简单介绍下 Uvicorn

答：目前，Python 仍缺乏异步的网关协议接口，ASGI 的出现填补了这一空白，现在开始，我们能够使用共同的标准为所有的异步框架来实现一些工具，ASGI 帮助 Python 在 Web 框架上和 Node.JS 及 Golang 相竟争，目标是获得高性能的 IO 密集型任务，ASGI 支持 HTTP2 和 WebSockets，WSGI 是不支持的。

Uvicorn 目前支持 HTTP1.1 和 WebSocket，计划支持 HTTP2。

使用方法：

$ pip install uvicorn

创建一个文件 example.py

async def app(scope, receive, send):
    assert scope['type'] == 'http'
    await send({
        'type': 'http.response.start',
        'status': 200,
        'headers': [
            [b'content-type', b'text/plain'],
        ]
    })
    await send({
        'type': 'http.response.body',
        'body': b'Hello, world!',
    })

启动 Uvicorn

$ uvicorn example:app

你也可以不使用命令行，直接运行你的脚本也是可以的，如下：

import uvicorn

async def app(scope, receive, send):
    ...

if __name__ == "__main__":
    uvicorn.run("example:app", host="127.0.0.1", port=5000, log_level="info")

FastAPI使用uvicorn

import uvicorn
from fastapi import FastAPI
 
app = FastAPI()
 
@app.get("/")
async def root():
    return {"message": "Hello World"}
 
if __name__ == '__main__':
    uvicorn.run(app=app)

深入到`uvicorn.run()`方法里面，看到一个：

def run(app, **kwargs):
    config = Config(app, **kwargs)
    server = Server(config=config)
 
    if (config.reload or config.workers > 1) and not isinstance(app, str):
        logger = logging.getLogger("uvicorn.error")
        logger.warn(
            "You must pass the application as an import string to enable 'reload' or 'workers'."
        )
        sys.exit(1)
 
    if config.should_reload:
        sock = config.bind_socket()
        supervisor = StatReload(config, target=server.run, sockets=[sock])
        supervisor.run()
    elif config.workers > 1:
        sock = config.bind_socket()
        supervisor = Multiprocess(config, target=server.run, sockets=[sock])
        supervisor.run()
    else:
        server.run()

再深入到 config = Config(app, **kwargs)里面，就看到一些很多的相关的配置信息项：

class Config:
    def __init__(
        self,
        app,
        host="127.0.0.1",
        port=8000,
        uds=None,
        fd=None,
        loop="auto",
        http="auto",
        ws="auto",
        lifespan="auto",
        env_file=None,
        log_config=LOGGING_CONFIG,
        log_level=None,
        access_log=True,
        use_colors=None,
        interface="auto",
        debug=False,
        reload=False,
        reload_dirs=None,
        workers=None,
        proxy_headers=True,
        forwarded_allow_ips=None,
        root_path="",
        limit_concurrency=None,
        limit_max_requests=None,
        backlog=2048,
        timeout_keep_alive=5,
        timeout_notify=30,
        callback_notify=None,
        ssl_keyfile=None,
        ssl_certfile=None,
        ssl_version=SSL_PROTOCOL_VERSION,
        ssl_cert_reqs=ssl.CERT_NONE,
        ssl_ca_certs=None,
        ssl_ciphers="TLSv1",
        headers=None,
    ):
....

所以还可以添加的参数可以看上面的几个配置的选项的信息来填：

于是乎还可以修改为：

uvicorn.run(app=app, host="127.0.0.1", port=8000, reload=True, debug=True)

发现本来想热更新代码，结果呐？有告警信息提示：

WARNING:  You must pass the application as an import string to enable 'reload' or 'workers'.

翻译过来就是说：警告：必须将应用程序作为导入字符串传递，才能启用“重新加载” 然后呢：我修改为：

  uvicorn.run(app='app', host="127.0.0.1", port=8000, reload=True, debug=True)

又提示：

ERROR:    Error loading ASGI app. Import string "app" must be in format "<module>:<attribute>".

好吧，我再看看官方文档说是：

在命令行下是需要：模块加app名称：刚好上面的错误提示也是说需要:

    uvicorn.run(app='main:app', host="127.0.0.1", port=8000, reload=True, debug=True)

这样之后就可以启动热更新重启服务了！

使用命令行时，你可以使用 uvicorn –help 来获取帮助。

Usage: uvicorn [OPTIONS] APP

Options:
  --host TEXT                     Bind socket to this host.  [default:
                                  127.0.0.1]
  --port INTEGER                  Bind socket to this port.  [default: 8000]
  --uds TEXT                      Bind to a UNIX domain socket.
  --fd INTEGER                    Bind to socket from this file descriptor.
  --reload                        Enable auto-reload.
  --reload-dir TEXT               Set reload directories explicitly, instead
                                  of using the current working directory.
  --workers INTEGER               Number of worker processes. Defaults to the
                                  $WEB_CONCURRENCY environment variable if
                                  available. Not valid with --reload.
  --loop [auto|asyncio|uvloop|iocp]
                                  Event loop implementation.  [default: auto]
  --http [auto|h11|httptools]     HTTP protocol implementation.  [default:
                                  auto]
  --ws [auto|none|websockets|wsproto]
                                  WebSocket protocol implementation.
                                  [default: auto]
  --lifespan [auto|on|off]        Lifespan implementation.  [default: auto]
  --interface [auto|asgi3|asgi2|wsgi]
                                  Select ASGI3, ASGI2, or WSGI as the
                                  application interface.  [default: auto]
  --env-file PATH                 Environment configuration file.
  --log-config PATH               Logging configuration file.
  --log-level [critical|error|warning|info|debug|trace]
                                  Log level. [default: info]
  --access-log / --no-access-log  Enable/Disable access log.
  --use-colors / --no-use-colors  Enable/Disable colorized logging.
  --proxy-headers / --no-proxy-headers
                                  Enable/Disable X-Forwarded-Proto,
                                  X-Forwarded-For, X-Forwarded-Port to
                                  populate remote address info.
  --forwarded-allow-ips TEXT      Comma separated list of IPs to trust with
                                  proxy headers. Defaults to the
                                  $FORWARDED_ALLOW_IPS environment variable if
                                  available, or '127.0.0.1'.
  --root-path TEXT                Set the ASGI 'root_path' for applications
                                  submounted below a given URL path.
  --limit-concurrency INTEGER     Maximum number of concurrent connections or
                                  tasks to allow, before issuing HTTP 503
                                  responses.
  --backlog INTEGER               Maximum number of connections to hold in
                                  backlog
  --limit-max-requests INTEGER    Maximum number of requests to service before
                                  terminating the process.
  --timeout-keep-alive INTEGER    Close Keep-Alive connections if no new data
                                  is received within this timeout.  [default:
                                  5]
  --ssl-keyfile TEXT              SSL key file
  --ssl-certfile TEXT             SSL certificate file
  --ssl-version INTEGER           SSL version to use (see stdlib ssl module's)
                                  [default: 2]
  --ssl-cert-reqs INTEGER         Whether client certificate is required (see
                                  stdlib ssl module's)  [default: 0]
  --ssl-ca-certs TEXT             CA certificates file
  --ssl-ciphers TEXT              Ciphers to use (see stdlib ssl module's)
                                  [default: TLSv1]
  --header TEXT                   Specify custom default HTTP response headers
                                  as a Name:Value pair
  --help                          Show this message and exit.

使用进程管理器

使用进程管理器确保你以弹性方式运行运行多个进程，你可以执行服务器升级而不会丢弃客户端的请求。

一个进程管理器将会处理套接字设置，启动多个服务器进程，监控进程活动，监听进程重启、关闭等信号。

Uvicorn 提供一个轻量级的方法来运行多个工作进程，比如 --workers 4，但并没有提供进行的监控。

使用 Gunicorn

Gunicorn 是成熟的，功能齐全的服务器，Uvicorn 内部包含有 Guicorn 的 workers 类，允许你运行 ASGI 应用程序，这些 workers 继承了所有 Uvicorn 高性能的特点，并且给你使用 Guicorn 来进行进程管理。

这样的话，你可能动态增加或减少进程数量，平滑地重启工作进程，或者升级服务器而无需停机。

在生产环境中，Guicorn 大概是最简单的方式来管理 Uvicorn 了，生产环境部署我们推荐使用 Guicorn 和 Uvicorn 的 worker 类：

gunicorn example:app -w 4 -k uvicorn.workers.UvicornWorker

执行上述命令将开户 4 个工作进程，其中 UvicornWorker 的实现使用 uvloop 和httptools 实现。在 PyPy 下运行，你可以使用纯 Python 实现，可以通过使用UvicornH11Worker 类来做到这一点。

gunicorn -w 4 -k uvicorn.workers.UvicornH11Worker

Gunicorn 为 Uvicorn 提供了不同的配置选项集，但是一些配置暂不支持，如–limit-concurrency 。

使用 Supervisor

要supervisor用作流程管理器，您应该：

使用其文件描述符将套接字移交给uvicorn，supervisor始终将其用作0，并且必须在本fcgi-program节中进行设置。

或为每个uvicorn进程使用UNIX域套接字。

一个简单的主管配置可能看起来像这样： administratord.conf：

[supervisord]

[fcgi-program:uvicorn]
socket=tcp://localhost:8000
command=venv/bin/uvicorn --fd 0 example:App
numprocs=4
process_name=uvicorn-%(process_num)d
stdout_logfile=/dev/stdout
stdout_logfile_maxbytes=0

然后运行`supervisord -n`。

使用 Circus

要circus用作流程管理器，您应该：

使用其文件描述符将套接字移交给uvicorn，马戏团可将其用作$(circus.sockets.web)。
或为每个uvicorn进程使用UNIX域套接字。

使用 Circus 与 Supervisor 很类似。配置文件 circus.ini 如下：

[watcher:web]
cmd = venv/bin/uvicorn --fd $(circus.sockets.web) example:App
use_sockets = True
numprocesses = 4

[socket:web]
host = 0.0.0.0
port = 8000

然后运行circusd circus.ini。

与 Nginx 部署

Nginx 作为 Uvicorn 进程的代理并不是必须的，你可以使用 Nginx 做为负载均衡。推荐使用 Nginx 时配置请求头，如 X-Forwarded-For,X-Forwarded-Proto，以便 Uvicorn 识别出真正的客户端信息，如 IP 地址，scheme 等。这里有一个配置文件的样例：

http {
  server {
    listen 80;
    client_max_body_size 4G;

    server_name example.com;

    location / {
      proxy_set_header Host $http_host;
      proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
      proxy_set_header X-Forwarded-Proto $scheme;
      proxy_redirect off;
      proxy_buffering off;
      proxy_pass http://uvicorn;
    }

    location /static {
      # path for static files
      root /path/to/app/static;
    }
  }

  upstream uvicorn {
    server unix:/tmp/uvicorn.sock;
  }

}

使用 HTTPS

要使用https运行uvicorn，需要证书和私钥。推荐的获取方法是使用Let’s Encrypt。

对于使用https进行本地开发，可以使用mkcert 生成有效的证书和私钥。

$ uvicorn example:app --port 5000 --ssl-keyfile=./key.pem --ssl-certfile=./cert.pem

使用 Gunicorn 也可以直接使用证书。

也可以与uvicorn的工人一起使用证书来获取gunicorn

$ gunicorn --keyfile=./key.pem --certfile=./cert.pem -k uvicorn.workers.UvicornWorker example:ap

大语言模型借助Embedding将人类的语言 “编码”成自己的语言，然后通过注意力Attention从中提取各种丰富的知识和结构，加权积累与关联生成自己的语言，然后“编码”回人类的语言

半导体行业的新闻、研报-资源汇总-自媒体参考4大咨询公司等的研报

半导体行业的新闻、研报啥的，发现能找到的资料都比较凌乱，于是自己整理了一份资源汇总，分享给大家，希望能方便大家找各种资讯。汇总表还比较简单，欢迎留下自己的意见以及继续补充。

序号	机构	类别	地址	备注
1	SEMI	行业协会	https://www.semi.org/en	专业，但是需要企业会员
2	SIA 美国半导体行业协会	行业协会	https://www.semiconductors.org/	主要关注美国市场
3	WSTS 全球半导体贸易统计协会	行业协会	https://www.wsts.org/	比较权威
4	TSMC 台积电	半导体科技	https://investor.tsmc.com/english/annual-reports	全球龙头，看年报季报
5	Samsung 三星	半导体科技	https://www.samsung.com/global/ir/financial-information/audited-financial-statements/	韩国龙头，看年报季报
6	Intel 因特尔	半导体科技	https://www.intc.com/filings-reports/annual-reports	美国龙头，看年报季报
7	SMIC 中芯	半导体科技	https://www.smics.com/site/company_financialSummary	大陆龙头，看年报季报
8	Gartner 高德纳	咨询	https://www.gartner.com/en/newsroom	不定期更新
9	Mckinsey 麦肯锡	咨询	https://www.mckinsey.com/industries/semiconductors/our-insights	不定期更新
10	PwC 普华永道	咨询	https://www.pwc.com/gx/en/industries/tmt.html	不定期更新
11	Accenture 埃森哲	咨询	https://www.accenture.com/us-en/services/high-tech/semiconductors	对TMT都比较专业
12	Goldman Sachs 高盛	金融财务	https://www.goldmansachs.com/insights/index.html	公司研究的报告不免费
13	KPMG 毕马威	金融财务	https://home.kpmg/xx/en/home/insights.html	有较多行业数据
14	Seeking Alpha	媒体	https://seekingalpha.com/	投资者的讨论比较多
15	Bloomberg 彭博	媒体	https://www.bloomberg.com/	行业新闻及指数
16	Google Finance 谷歌财经	媒体	https://www.google.com/finance/	行业新闻及指数
17	Reuters 路透社	媒体	https://www.reuters.com/news/archive/rbssSemiconductors	行业新闻有专栏
18	Statista	媒体	https://www.statista.com/topics/1182/semiconductors/	行业指数
19	萝卜投研	媒体	https://robo.datayes.com/	国内研报
20	半导体行业观测	媒体	http://semiinsights.com/	中文新闻，一手报道比较少

深入研究 Transformer 模型时，将反复提到 architecture 和 checkpoints 以及 model。这些术语的含义略有不同：architecture：这是模型的结构——每个层的定义以及模型中发生的每个操作。checkpoints：这些是将在给定架构中加载的权重。model：这是一个总括术语，不像 “architecture” 或 “checkpoints” 那么精确。它可以同时表示两者。例如，BERT是一种 architecture，而BERT-base-cased（谷歌团队 BERT 的第一个版本训练的一组权重）是一个 checkpoints。然而，人们可以说 “BERT model” 和“BERT-base-cased model”

召回率

召回率是指在所有正样本中，被成功预测为正样本的样本数占比。在机器学习领域，召回率是评估模型预测性能的重要指标之一。在本文中，我们将从多个方面深入探讨召回率的概念和应用。
阈值越高，精准率越高，召回率越低；阈值越低，精准率越低，召回率越高。
精准率（precision）的实际含义是，在所有预测为正例的样本中，真实为正例的比例。

召回率（recall）的实际含义是，在所有真实为正例的样本中，被正确预测为正例的比例。

举个例子，假设有一个垃圾邮件分类器，它要从100封邮件中识别出垃圾邮件。假设其中有20封是真正的垃圾邮件，80封是正常邮件。如果分类器预测出了15封垃圾邮件，其中10封是真的垃圾邮件，5封是误判的正常邮件，那么它的召回率和精准率分别是：

精准率 = 10 / 15 = 0.67，表示分类器预测出的垃圾邮件中有三分之二是真的垃圾邮件。

召回率 = 10 / 20 = 0.5，表示分类器只能找到一半的真正垃圾邮件。

一、什么是召回率
召回率是指模型成功检测到的所有真实正样本个数与真实正样本总数的比例。直观来讲，它衡量了模型在识别正样本方面的准确性。召回率数值越高，模型对正样本的识别越精准，但也可能带来更高的误报率。当真实正样本比较少时，召回率也会相对较低。

二、如何计算召回率
召回率的计算公式如下：

召回率 = 模型成功检测到的所有真实正样本个数 / 真实正样本总数
举个例子，假设某个模型对500个正样本进行了预测，其中正确识别出了450个，但漏识别了50个。这个模型的召回率为450/500=0.9，即90%。

三、什么情况下需要召回率
召回率适用于以下场景：

二分类问题，其中正样本数相对较少。
模型的预测结果需要被进一步修正或筛选。
需要优化模型的敏感性，即尽可能地识别更多的真实正样本。
四、如何提高召回率
提高召回率的方法有以下几种：

增加正样本数。增加正样本数能够提高召回率，但前提是正样本确实存在，并且可以扩大正样本范围。
调整模型阈值。降低模型阈值能够减少漏识别，但也会增加误报。调整模型阈值需要根据具体场景进行。
优化模型训练。在模型训练时，可以多采用正样本，增加正样本的特征和数量，有效提高模型的召回率。
采用不同的算法和模型。不同的算法和模型具有不同的优劣特点，在选择算法和模型时需要充分考虑召回率的需求。
五、召回率与精确率的关系
召回率和精确率都是模型评估中比较重要的指标。召回率注重的是尽可能识别更多的真实正样本，而精确率则注重的是尽可能排除误识别的负样本。一般而言，召回率与精确率存在一定的负相关关系，提高其中一个指标可能会降低另一个指标。在实际应用中，需要根据具体场景进行取舍。

六、总结
本文对召回率进行了深入的探讨，并从多个方面介绍了召回率的概念、计算、应用和提高方法。在实际应用中，需要根据具体场景选择合适的召回率指标和优化方法，以实现更好的模型性能。

月度归档：4 月 2024