书名：Python Web Scraping Cookbook
作者名：Michael Heydt
本章字数：92字
更新时间：2025-02-26 12:46:24

Working with Images, Audio, and other Assets

In this chapter, we will cover:

Downloading media content on the web
Parsing a URL with urllib to get the filename
Determining type of content for a URL
Determining a file extension from a content type
Downloading and saving images to the local file system
Downloading and saving images to S3
Generating thumbnails for images
Taking website screenshots with Selenium
Taking a website screenshot with an external service
Performing OCR on images with pytessaract
Creating a Video Thumbnail
Ripping an MP4 video to an MP3

本周热推：

一本书读懂24种互联网思维网络是怎样连接的一本书读懂TCP/IP 网络工程师红宝书：思科华为华三实战案例荟萃华为HCIA-Datacom认证指南

上一章目录下一章