Revision as of 22:42, 3 July 2017

Note: The Safe Browsing feature in Firefox has been renamed to Phishing Protection, but it's still known as Safe Browsing internally.

Download Protection and Tracking protection have their own separate pages.

History

Google Safe Browsing was an anti-phishing extension released by Google on labs.google.com in December 2005. Google has released this extension to the Mozilla Foundation under MPL 1.1/GPL 2.0/LGPL 2.1 in order that it might be used as part of Firefox if desired. We've landed this change on the trunk as a global extension as of 7 March 2006. You can read the discussion that lead up to to its integration in https://bugzilla.mozilla.org/show_bug.cgi?id=329292

Google started migrating their Safe Browsing to version 4 of the protocol in 2015. We are currently working on integrating V4 in our code base with a incremental approach. That is, we will be landing V4 patches progressively and leave the V2 stack up and running until the V4 is extensively tested and the Shavar server has been upgraded to V4. In other words, V2 and V4 will co-exist for a while to ensure we don't break Safe Browsing. See the V4 implementation plan for the milestones and bugs involved.

Prefs

browser.safebrowsing.blockedURIs.enabled: enable the plugin stability blocking (no override or UI)
browser.safebrowsing.debug: show debugging info from the JavaScript list update code on the command line
browser.safebrowsing.id: what SAFEBROWSING_ID in gethashURL and updateURL maps to
browser.safebrowsing.malware.enabled: enable malware protection (includes unwanted as well)
browser.safebrowsing.phishing.enabled: enable phishing protection
browser.safebrowsing.provider.google.gethashURL: server endpoint for completions of malware and phishing lists
browser.safebrowsing.provider.google.lists: list of tables coming from the Google Safe Browsing service
browser.safebrowsing.provider.google.reportURL: probably unused
browser.safebrowsing.provider.google.updateURL: server endpoint for malware and phishing list updates
browser.safebrowsing.provider.google.lastupdatetime: timestamp (in ms) of when the last list update happened.
browser.safebrowsing.provider.google.nextupdatetime: timestamp (in ms) of when the list should next be downloaded.
browser.safebrowsing.reportMalwareMistakeURL: destination for the "This isn't an attack site" button (after ignoring the interstitial warning)
browser.safebrowsing.reportPhishMistakeURL: destination for the "This isn't a web forgery" button (after ignoring the interstitial warning)
browser.safebrowsing.reportPhishURL: destination for the "Help | Report Web Forgery" menu item
urlclassifier.blockedTable: list of tables to use for the plugin stability blocking
urlclassifier.disallow_completions: list of tables for which we never call gethash
urlclassifier.gethashnoise: the number of fake entries to add to any gethash calls. Defaul value: 4. Maximum value: 999 (beyond, the Google request fails with HTTP 400).
urlclassifier.gethash.timeout_ms: the timeout after which gethash requests should be aborted
urlclassifier.malwareTable: list of tables to use when looking for malware (they need to be named *-malware-* or *-unwanted-*)
urlclassifier.max-complete-age: the maximum amount of time in seconds that a complete hash will be considered fresh and allowed to match
urlclassifier.phishTable: list of tables to use when looking for phishing (they need to be named *-phish-*)
urlclassifier.skipHostnames: comma-separated list of hostnames to exempt from Safe Browsing checks (hidden, only for temporary hotfix purposes)

Documentation

Official Google documentation:
- Safe Browsing protocol: v2.2 and v4
- User warning requirements
- Internal documentation available under NDA
- Android API (requires Google Play Services 9.4)
- Built-in support in WebView (public in Android O, private in Android N)
Design Documentation
- Server Spec
- Client Spec
SUMO
Overview of how Safe Browsing works in Firefox
Chromium
- Design documentation
- Implementation overview
Google's advice to site owners:

Engineering

Product/Component: Toolkit/Safe Browsing

~~Tracking bug~~ (deprecated, do not use)
The Firefox implementation is split into a few parts:
- browser/components/safebrowsing/ (front-end tests)
- netwerk/base/nsChannelClassifier
- toolkit/components/url-classifier/ (includes the list manager)
Local store is in:
- ~/.cache/mozilla/firefox/XXXX/safebrowsing/ on Linux
- ~/Library/Caches/Firefox/Profiles/XXXX/safebrowsing/ on Mac
- C:\Users\XXXX\AppData\Local\mozilla\firefox\profiles\XXXX\safebrowsing\ on Windows
itisatrap.org test pages
Telemetry dashboard

Code walkthrough

Both nsBaseChannel::Open() and nsBaseChannel::AsyncOpen() ask for the channel to be "classified" by nsChannelClassifier. There is also a local-only classification that is requested by tracking protection.

The classifier determines the type of URL that it is and then returns the appropriate NS_ERROR code. That causes the channel to be cancelled with that error code.

When the classification state of the page changes, the appropriate UI is shown.

QA

Test pages
- Malware, phishing, and unwanted software hard-coded test URLs
- Phishtank (real phishing sites)
- Google test pages (we don't implement: Clank Warnings, Client-side phishing detection, Bad IP Warnings)
- Static test pages for specific bugs
Meta QA bug
Info on why certain URLs are blocked
Script to dump the contents of the local store
UI tests (Marionette)

To turn on debugging output, export the following environment variables:

MOZ_LOG_FILE=/tmp/safebrowsing.log
MOZ_LOG="UrlClassifierDbService:5,nsChannelClassifier:5,UrlClassifierProtocolParser:5,UrlClassifierStreamUpdater:5,UrlClassifierPrefixSet:5"

and also see the browser.safebrowsing.debug pref to see debugging output from the JS pieces of Safe Browsing.

Telemetry

Alerts are sent to safebrowsing-telemetry@mozilla.org.

Performance
- URLCLASSIFIER_ASYNC_CLASSIFYLOCAL_TIME: time spent inside AsyncClassifyLocalWithTables()
- URLCLASSIFIER_CLASSIFYLOCAL_TIME: time spent inside ClassifyLocalWithTables()
- URLCLASSIFIER_CL_CHECK_TIME: how long a Safe Browsing lookup took
- URLCLASSIFIER_CL_KEYED_UPDATE_TIME: how long table updates takes
- URLCLASSIFIER_LOOKUP_TIME_2: time spent in the dbservice while doing a lookup
- URLCLASSIFIER_PS_CONSTRUCT_TIME: time spent constructing a PrefixSet
- URLCLASSIFIER_PS_FALLOCATE_TIME: time spent allocating a PrefixSet
- URLCLASSIFIER_PS_FILELOAD_TIME: time spent loading PrefixSet from disk
- URLCLASSIFIER_SHUTDOWN_TIME: time spent in the URL Classifier shutdown code
- URLCLASSIFIER_VLPS_CONSTRUCT_TIME: time spent constructing a variable-length PrefixSet
- URLCLASSIFIER_VLPS_FALLOCATE_TIME: time spent allocating a variable-length PrefixSet
- URLCLASSIFIER_VLPS_FILELOAD_TIME: time spent loading a variable-length PrefixSet from disk
Server-related
- URLCLASSIFIER_COMPLETE_REMOTE_STATUS2: HTTP status code returned by the gethash server
- URLCLASSIFIER_COMPLETE_SERVER_RESPONSE_TIME: response time from the completion server
- URLCLASSIFIER_COMPLETE_TIMEOUT2: whether or not a client timed out while contacting the gethash server
- URLCLASSIFIER_COMPLETION_ERROR: whether a V4 completion result couldn't be parsed or contained an unknown threat type
- URLCLASSIFIER_UPDATE_ERROR: whether or not an error was encountered while processing an update
- URLCLASSIFIER_UPDATE_REMOTE_NETWORK_ERROR: update errors while downloading updates
- URLCLASSIFIER_UPDATE_REMOTE_STATUS2: HTTP status code returned by the update server
- URLCLASSIFIER_UPDATE_SERVER_RESPONSE_TIME: response time from the update server
- URLCLASSIFIER_UPDATE_TIMEOUT: whether or not a client timed out while contacting the update server
Database size
- URLCLASSIFIER_LC_COMPLETIONS: number of entries in the completion cache
- URLCLASSIFIER_LC_PREFIXES: number of entries in the prefix cache
User interface
- SECURITY_UI: number of interstitial pages shown (malware, phishing, unwanted) either in a top-level page or in a frame and the number of times users click on "Ignore this warning", "Get me out of here" or "Why is this blocked?"
V4 quality assurance
- APPLICATION_REPUTATION_ALLOWLIST_MATCH: comparison between the V2 and V4 lists for the application reputation whitelist
- APPLICATION_REPUTATION_BLOCKLIST_MATCH: comparison between the V2 and V4 lists for the application reputation blacklist
- URLCLASSIFIER_NEGATIVE_CACHE_DURATION: negative cache duration received in fullhash response
- URLCLASSIFIER_POSITIVE_CACHE_DURATION: positive cache duration received in fullhash response
- URLCLASSIFIER_VLPS_LOAD_CORRUPT: whether or not a variable-length PrefixSet loaded from disk is corrupt
- URLCLASSIFIER_VLPS_LONG_PREFIXES: length of the variable-length prefixes that are sent by Google

Links

Google reporting forms:
- Malware
- Phishing -- Firefox-specific
- Phishing error (false positive) -- Firefox-specific
StopBadware.org form:
- Malware error
API key and account details (internal access only)

@@ Line 132: / Line 132: @@
 ** [https://telemetry.mozilla.org/new-pipeline/dist.html#!cumulative=0&end_date=2017-04-19&keys=__none__!__none__!__none__&max_channel_version=nightly%252F55&measure=APPLICATION_REPUTATION_ALLOWLIST_MATCH&min_channel_version=null&processType=*&product=Firefox&sanitize=1&sort_keys=submissions&start_date=2017-03-08&table=0&trim=1&use_submission_date=0 APPLICATION_REPUTATION_ALLOWLIST_MATCH]: comparison between the V2 and V4 lists for the application reputation whitelist
 ** [https://telemetry.mozilla.org/new-pipeline/dist.html#!cumulative=0&end_date=2017-04-19&keys=__none__!__none__!__none__&max_channel_version=nightly%252F55&measure=APPLICATION_REPUTATION_BLOCKLIST_MATCH&min_channel_version=null&processType=*&product=Firefox&sanitize=1&sort_keys=submissions&start_date=2017-03-08&table=1&trim=1&use_submission_date=0 APPLICATION_REPUTATION_BLOCKLIST_MATCH]: comparison between the V2 and V4 lists for the application reputation blacklist
-** [https://telemetry.mozilla.org/new-pipeline/dist.html#!cumulative=0&end_date=2017-04-19&keys=__none__!__none__!__none__&max_channel_version=nightly%252F55&measure=URLCLASSIFIER_MATCH_RESULT&min_channel_version=null&processType=*&product=Firefox&sanitize=1&sort_keys=submissions&start_date=2017-03-08&table=1&trim=1&use_submission_date=0 URLCLASSIFIER_MATCH_RESULT]: comparison between V2 and V4 in URL lookups
-** [https://telemetry.mozilla.org/new-pipeline/dist.html#!cumulative=0&end_date=2017-04-19&keys=__none__!__none__!__none__&max_channel_version=nightly%252F55&measure=URLCLASSIFIER_MATCH_THREAT_TYPE_RESULT&min_channel_version=null&processType=*&product=Firefox&sanitize=1&sort_keys=submissions&start_date=2017-04-13&table=1&trim=1&use_submission_date=0 URLCLASSIFIER_MATCH_THREAT_TYPE_RESULT]: comparison of threat types between V2 and V4 for full URL matches
 ** [https://telemetry.mozilla.org/new-pipeline/dist.html#!cumulative=0&end_date=2017-04-19&keys=__none__!__none__!__none__&max_channel_version=nightly%252F55&measure=URLCLASSIFIER_NEGATIVE_CACHE_DURATION&min_channel_version=null&processType=*&product=Firefox&sanitize=1&sort_keys=submissions&start_date=2017-03-23&table=1&trim=1&use_submission_date=0 URLCLASSIFIER_NEGATIVE_CACHE_DURATION]: negative cache duration received in fullhash response
 ** [https://telemetry.mozilla.org/new-pipeline/dist.html#!cumulative=0&end_date=2017-04-18&keys=__none__!__none__!__none__&max_channel_version=nightly%252F55&measure=URLCLASSIFIER_POSITIVE_CACHE_DURATION&min_channel_version=null&processType=*&product=Firefox&sanitize=1&sort_keys=submissions&start_date=2017-03-22&table=1&trim=1&use_submission_date=0 URLCLASSIFIER_POSITIVE_CACHE_DURATION]: positive cache duration received in fullhash response

Security/Safe Browsing: Difference between revisions

Revision as of 22:42, 3 July 2017

Contents

History

Prefs

Documentation

Engineering

Code walkthrough

QA

Telemetry

Links

Navigation menu

Security/Safe Browsing: Difference between revisions

Revision as of 22:42, 3 July 2017

History

Prefs

Documentation

Engineering

Code walkthrough

QA

Telemetry

Links

Navigation menu

Search