Necko/Cache/Plans: Difference between revisions

← Older edit

Necko/Cache/Plans (view source)

Revision as of 18:40, 14 June 2022

2,682 bytes added , 14 June 2022

dummy edit

Edguloien

27

edits

@@ Line 1: / Line 1: @@
-This page is a WIP.
+= New Cache Plans =
-== Locking / Initialization Performance / Asynchronicity ==
+We have decided to rewrite our HTTP disk cache.
-* {{bug|701909}}
-* {{bug|716293}}
-* {{bug|716172}}
-* {{bug|717761}}
-* {{bug|722034}}
-== Versioning / Upgrade Safety ==
+== People ==
-* {{bug|715752}}
-== On-Disk Layout Changes ==
+The design team will be responsible for coming up with a design for the new disk cache. The design should be thorough and well-documented. Once the design team is satisfied with an initial design document, the implementation team will start implementing.
-=== Crash Recovery ===
-* {{bug|105843}}
-=== Layout Optimization ===
-* {{bug|715714}}
-== Miscellaneous Concerns ==
+Design team:
-* {{bug|648605}} (eviction strategy)
+* Michal Novotny
+* Taras Glek
+* Steve Workman
+* Honza Bambas
+* Nick Hurley
+* Brian Bondy
+* Doug Turner
+* Patrick McManus
+* Steve Workman
+<br>Implementation team:
+* Honza Bambas
+* Michal Novotny
+== Primary Design Goals ==
+This section documents issues that need to be addressed in the new cache's design.
+* Version API for the cache so we can update easily.
+* All APIs should be async. No main-thread locking or i/o at all.
+* A crash or abnormal program termination should not invalidate the entire cache.
+* Support gzip compression. Meta-data should say whether a file is gzip'd or not, can choose to write compressed or uncompressed data on a per-file basis at runtime. Pass through files gzip'd from the network.
+* Make use of fallocate.
+* Minimize API surface, especially for APIs exposed to JS/extensions. All exposed APIs should have a clear, safe use case.
+* Consider eliminating memory cache.
+* Competing ideas:
+** Temporal layout so that sub-resources are together.
+** Don't over-optimize on-disk storage, use one file per entry and let OS optimize.
+* Layered design should include XPCOM API, C++ API exposed to Gecko, middle layer with general cache logic, and back-end allowing for alternative on-disk formats.
+* Separate services for HTTP and offline cache? Find a way to make these use cases work well without over-complicating code.
+* Browser should behave properly with disk cache entirely disabled.
+* Allow for effectively racing cache against network, so as to not wait serially.
+* Use this very same cache for more general meta-like data, e.g. cache hosts for DNS prewarms, appcache namespaces + its other data and versioning, any useful host specific data we now getter in memory and throw away after restart (SPDY preference, TLS tolerance, pipeline successful test, etc...)
+== Success Metrics ==
+This section documents the ways in which we'll determine whether or not the new cache design is a success.
+* Should not be possible to trigger main-thread i/o.
+* Create telemetry for with-cache and without-cache. For top 50% cache should be faster than no cache, for low 50% cache should be faster than no cache.
+== API ==
+This section documents the APIs for interacting with the new disk cache.
+[[Necko/Cache/Plans/Draft Proposal|API Changes proposal]]
+=== XPCOM APIs (exposed to JS) ===
+=== C++ APIs (exposed to Necko) ===
+== Locking ==
+This section describes how locking will work in the new disk cache. Ideally this should document every lock that will be necessary in the new cache.
+== On-Disk Layout ==
+This section describes the on-disk layout of the disk cache. It may describe a default on-disk layout and any number of alternatives required for the first revision of the new disk cache.