Managing video storage on the web
Video is a tough asset to manage; streaming takes a lot of bandwidth and caching is not straightforward. These issues are compounded when videos play on a loop, like in a kiosk display. If, for instance, a company has hundreds of devices playing 30 videos on repeat all day, every day, it could quickly overwhelm their network. By serving the videos from cache instead of streaming them, you incur the download cost only once, make subsequent plays faster, and make them available to play offline. To do this, you can take advantage of the browser’s storage capabilities, of which the Cache storage API and IndexedDB are the most suitable to store video files. While both are good options, we’ll focus on the Cache storage API for its integration with the popular service worker library Workbox.
Caching video from a service worker
Because downloading and caching large assets like videos can be a particularly time and processor intensive task, you should do it in the background off the main thread. Service workers are particularly useful for offloading caching tasks. They act as a proxy between the page and the network, allowing it to intercept requests and apply additional logic to the network response, for example, a caching strategy.
There are many different caching strategies and each of them are designed to help in different use cases. For example, to serve a file from a cache if available, or fall back to the network if not, you can write the following code.
Managing this for different asset types or URLs that require different caching strategies can be a repetitive and error-prone process. Workbox provides a set of tools, including routing helpers and caching strategies, that let you write service worker code in a more declarative and reusable way.
The previous strategy is called cache first. To write the same thing using Workbox, you’d include the following:
Workbox provides similar recipes for other caching strategies and common service worker tasks, including integration with build tools like Webpack and Rollup.
With Workbox set up, you then need to choose when you’re going to cache your videos. Here, there are two approaches: eagerly on page load, or lazily when the video is requested.
Eager approach
Precaching is a technique in which files are saved to the cache during service worker installation, making them available as soon as the service work is. Workbox can automatically set up precaching for files it can access during your build process.
The following Workbox code can be used in your service worker to to precache files:
import
(s) - Load the bindings required from the corresponding Workbox modules. Because service workers don’t support ESModules universally yet, your Workbox-powered service worker will need to be passed through a bundler for it to work in production.RangeRequestsPlugin
- Makes it possible for a request with aRange
header to be fulfilled by a cached response. This is necessary because browsers typically use aRange
header for media content.addPlugins
- Allows you to add Workbox plugins to every Workbox request.precacheAndRoute
- Adds entries to the precache list and creates a route to handle the corresponding fetch requests.__WB_MANIFEST
- A placeholder that the Workbox CLI (or build tool plugins)replaces with the precache manifest.
Pass your service worker into either the Workbox CLI or your build tool of choice and configure how your precache should be generated; a workbox-config.js
file, like the following,will tell the CLI how it should render your service worker:
globDirectory
- The root folder to start searching for precache files fromglobPatterns
- The file patterns (“globs”) that should be precached.maximumFileSizeToCacheInBytes
- An upper limit for the size a file can be to be precached, in bytes.swSrc
- The location of the file that will be used to generate your service worker.swDest
- The destination for the generated service worker (it can be the same as the source file, but make sureself.__WB_MANIFEST
is present for each run).
When the build process runs, a new version of the service worker is generated, and self.__WB_MANIFEST
is replaced with a list of files, each with a hash to denote their revision:
Every time the build process runs, this list is rewritten with the current set of matching files and their current revision hashes. This ensures that whenever a file is added, removed, or changed, the service worker will update the cache on its next install.
Lazy approach
When you don’t have all of the videos available at build time, or only want to cache videos when they’re needed, you should employ a lazy approach. This approach requires the caching and serving to be separated; because only partial content is fetched from the network during video playback, caching files as they stream won’t work.
Caching the files
Caches can be created using Cache.open(), and then files can be added to the cache using Cache.add() or Cache.addAll(). If your app receives a JSON list of videos to cache, they can be added to a video cache as follows:
The advantage of this approach is that you can control the caching step independently of the service worker lifecycle, even from other web workers. The downside is that the storage management part is up to the developer: you need to write your own algorithm to track file changes, track the currently cached files in the browser, and manage file updates to ensure that only changed files get updated.
Serving cached video files
A service worker runtime caching strategy, like cache first can then be used to serve the video files previously cached:
import
(s) - Loads the bindings required from the corresponding workbox modules.registerRoute
-Routes requests to functions (caching strategies and plugins) that provide responses.CacheFirst
- Caching strategy that fulfills the request from the cache, if available, otherwise fetches it from the network and updates the cache.CacheableResponsePlugin
- Used to indicate what headers need to be present for the response to be cacheable. Be sure to only include 200 statuses for routes caching video to avoid partial content responses (206) being cached as videos are streamed.RangeRequestsPlugin
- Plugin that makes it possible for a request with aRange
header to be fulfilled by a cached response. This is necessary because browsers typically use aRange
header for media content.
Optimizing video loading is an important task for apps that do intensive streaming. By leveraging the browser’s Cache storage API and Workbox, you can make this otherwise hard task manageable, saving your users’ bandwidth, reducing server load, achieving faster video playback, and letting your videos run even when offline.