- Python Web Scraping Cookbook
- Michael Heydt
- 92字
- 2021-06-30 18:44:11
Working with Images, Audio, and other Assets
In this chapter, we will cover:
- Downloading media content on the web
- Parsing a URL with urllib to get the filename
- Determining type of content for a URL
- Determining a file extension from a content type
- Downloading and saving images to the local file system
- Downloading and saving images to S3
- Generating thumbnails for images
- Taking website screenshots with Selenium
- Taking a website screenshot with an external service
- Performing OCR on images with pytessaract
- Creating a Video Thumbnail
- Ripping an MP4 video to an MP3