Copyright ©2000 W3C® (MIT, INRIA, Keio), All Rights Reserved. W3C liability, trademark, document use and software licensing rules apply.
This is the specification of the Platform for Privacy Preferences (P3P). This document, along with its normative references, includes all the specification necessary for the implementation of interoperable P3P applications.
This section describes the status of this document at the time of its publication. Other documents may supersede this document. The latest status of this document series is maintained at the W3C.
This is a W3C Working Draft for review by W3C members and other interested parties. This document has been produced by the P3P Specification Working Group as part of the P3P Activity, and it is the third revision of the last call draft issued the 2nd of November 1999 (http://www.w3.org/TR/1999/WD-P3P-19991102). A change log is included at the end of this document for convenience. The last call period is expected to end on April 30, 2000. A revised version of this specification is expected to advance toward W3C Recommendation status after two interoperable implementations have been demonstrated.
This Working Draft includes an extension mechanism that can be used to extend the P3P vocabulary. The Working Group is particularly interested in feedback on how to improve this mechanism as well as examples of extensions that people would like to propose. These examples may be useful for improving the design of the extension mechanism. In addition, the Working Group may consider incorporating some of these ideas into the P3P vocabulary so that they need not be introduced later as extensions. The introduction to this document (Section 1) provides additional information about the status of this working draft and future versions of P3P.
While this document is in last call, it is still a draft document that may be updated, replaced, or obsoleted by other documents at any time. It is therefore inappropriate to use W3C Working Drafts as reference material or to cite them as other than "work in progress." A list of current W3C working drafts can be found at http://www.w3.org/TR/.
Please send comments to www-p3p-public-comments@w3.org (archived at http://lists.w3.org/Archives/Public/www-p3p-public-comments/).
The Platform for Privacy Preferences Project (P3P) enables Web sites to express their privacy practices in a standard format that can be retrieved automatically and interpreted easily by user agents. P3P user agents will allow users to be informed of site practices (in both machine- and human-readable formats) and to automate decision-making based on these practices when appropriate. Thus users need not read the privacy policies at every site they visit.
Although P3P provides a technical mechanism for ensuring that users can be informed about privacy policies before they release personal information, it does not provide a technical mechanism for making sure sites act according to their policies. Products implementing this specification MAY provide some assistance in that regard, but that is up to specific implementations and outside the scope of this specification. However, P3P is complementary to laws and self-regulatory programs that can provide enforcement mechanisms. In addition, P3P does not include mechanisms for transferring data or for securing personal data in transit or storage. P3P may be built into tools designed to facilitate data transfer. These tools should include appropriate security safeguards.
The P3P1.0 specification defines the syntax and semantics of P3P privacy policies, and the mechanisms for associating policies with Web resources. P3P policies consist of statements made using the P3P vocabulary for expressing privacy practices. P3P policies also reference elements of the P3P base data schema -- a standard set of data elements that all P3P user agents should be aware of. The P3P specification includes a mechanism for defining new data elements and data sets, and a simple mechanism that allows for extensions to the P3P vocabulary.
P3P version 1.0 is a protocol designed to inform Web users of the data-collection practices of Web sites. It provides a way for a Web site to encode its data-collection and data-use practices in a machine-readable XML format known as a P3P policy. The P3P specification defines:
The goal of P3P version 1.0 is twofold. First, it allows Web sites to present their data-collection practices in a standardized, machine-readable, easy-to-locate manner. Second, it enables Web users to understand what data will be collected by sites they visit, how that data will be used, and what data/uses they may "opt-out" of or "opt-in" to.
As an introduction to P3P, let us consider one common scenario which makes use of P3P. Sheila has decided to check out a store called CatalogExample, located at http://www.catalog.example.com/. Let us assume that CatalogExample has placed P3P policies on all their pages, and that Sheila is using a Web browser with P3P built in.
Sheila types the address for CatalogExample into her Web browser. When CatalogExample's server returns their homepage, it also returns the P3P privacy policy which applies to that page. The policy states that the only data the site collects on its home page is the data found in standard HTTP access logs. Now Sheila's Web browser checks this policy against the preferences Sheila has given it. Is this policy acceptable to her, or should she be notified? Let's assume that Sheila has told her browser that this is acceptable. In this case, the homepage is displayed normally, with no pop-up messages appearing. Perhaps her browser displays a small icon somewhere along the edge of its window to tell her that a privacy policy was given by the site, and that it matched her preferences.
Next, Sheila clicks on a link to the site's online catalog. The catalog section of the site has some more complex software behind it. This software uses cookies to implement a "shopping cart" feature. Since more information is being gathered in this section of the web site, the Web server sends a new P3P policy to Sheila's browser. Again, let's assume that this policy matches Sheila's preferences, so she gets no pop-up messages. Sheila continues and selects a few items she wishes to purchase. Then she proceeds to the checkout page.
The checkout page of CatalogExample requires some additional information: Sheila's name, address, credit card number, and telephone number. The web site sends a new P3P policy that describes the data that is collected here and states that her data will be used only for completing the current transaction, her order.
Sheila's browser examines this P3P policy. Imagine that Sheila has told her browser that she wants to be warned whenever a site asks for her phone number. In this case, the browser will pop up a message saying that this Web site is asking for her phone number, and then explains the contents of the P3P statement. Sheila can then decide if this is acceptable to her. If it is acceptable, she can continue with her order; otherwise she can cancel the transaction.
Alternatively, Sheila could have told her browser that she wanted to be warned only if a site is asking for her telephone number and was going to give it to third parties and/or use it for uses other than completing the current transaction. In that case, she would have received no prompts from her browser at all, and she could proceed with completing her order.
Note that this scenario describes one hypothetical implementation of P3P. Other types of user interfaces are also possible.
P3P policies use an XML encoding of the P3P vocabulary to identify the legal entity making the representation of privacy practices in a policy, enumerate the types of data or data elements collected, and explain how the data will be used. In addition, policies identify the data recipients, and make a variety of other disclosures including information about dispute resolution, and the address of a site's human-readable privacy policy. P3P policies must cover all relevant data elements and practices (but note that legal issues regarding law enforcement demands for information are not addressed by this specification; it is possible that a site that otherwise abides by its policy of not redistributing data to others may be required to do so by force of law). P3P declarations are positive, meaning that sites state what they do, rather than what they do not do. The P3P vocabulary is designed to be descriptive of a site's practices rather than simply an indicator of compliance with a particular law or code of conduct. However, user agents may be developed that can test whether a site's practices are compliant with a law or code.
P3P policies represent the practices of the site. Intermediaries such as telecommunication providers, Internet service providers, proxies and others may be privy to the exchange of data between a site and a user, but their practices may not be governed by the site's policies.
P3P1.0 user agents can be built into web browsers, browser plug-ins, or proxy servers. They can also be implemented as Java applets or JavaScript; or built into electronic wallets, automatic form-fillers, or other user data management tools. P3P user agents look for P3P headers in HTTP responses and in P3P META tags embedded in HTML content. These special headers and tags indicate the location of a relevant P3P policy. User agents can fetch the policy from the indicated location, parse it, and display symbols, play sounds, or generate user prompts that reflect a site's P3P privacy practices. They can also compare P3P policies with privacy preferences set by the user and take appropriate actions. P3P can perform a sort of "gate keeper" function for data transfer mechanisms such as electronic wallets and automatic form fillers. A P3P user agent integrated into one of these mechanisms would retrieve P3P policies, compare them with user's preferences, and authorize the release of data only if a) the policy is consistent with the user's preferences and b) the requested data transfer is consistent with the policy. If one of these conditions is not met, the user might be informed of the discrepancy and given an opportunity to authorize the data release themselves.
Web sites can implement P3P1.0 on their servers by translating their human-readable privacy policies into P3P syntax and configuring their servers to advertise the location of the P3P policy. Automated tools can assist sites in performing this translation. Many HTTP1.1 servers can be configured to support P3P1.0 without requiring the installation of additional software. Servers may be configured to insert a P3P extension header into all HTTP responses that indicates the location of a site's P3P policy, using the HTTP Extension Framework. Alternatively, they can be configured to insert this information into HTML content as a META tag. Web sites have some flexibility in how they use P3P: they can opt for one P3P policy for their entire site or they can designate different policies for different parts of their sites. A P3P policy MUST cover all data generated or exchanged as part of a site's HTTP interactions with visitors. In addition, some sites may wish to write policies that cover all data an entity collects, regardless of how the data is collected.
The P3P Specification Working Group removed significant sections from earlier drafts of the P3P1.0 specification in order to facilitate rapid implementation and deployment of a P3P first step. The group envisions the release of future versions of the P3P specification after P3P1.0 is deployed. This specification would likely include improvements based on feedback from implementation and deployment experience as well as four major components that were part of the original P3P vision but not included in P3P1.0:
This document, along with its normative references, includes all the specification necessary for the implementation of interoperable P3P applications.
The [ABNF] notation used in this specification is specified in RFC2234 and summarized in Appendix 7. However, note that such syntax is only a grammar representative of the XML syntax: all the syntactic flexibilities of XML are also implicitly included; e.g. whitespace rules, quoting using either single quote (') or double quote ("), character escaping, comments, case sensitivity, order of attributes.
The following key words are used throughout the document and should be read as interoperability requirements. This specification uses words as defined in RFC2119 [KEY] for defining the significance of each particular requirement. These words are:
Referencing a P3P policy is one of the first steps in the operation of the P3P protocol. Services use policy references to state what policy applies to a specific URI or set of URIs. User agents will use policy references to locate the privacy policy which applies to a page, so that they can process that policy for the benefit of their user.
Policy references are extensively as a performance optimization. Privacy policies are typically several kilobytes of data, while a URI which references a privacy policy is typically less than 50 bytes. In addition to the bandwidth savings, policy references also reduce the need for computation: policies can be uniquely associated with URIs, so that a user agent need only parse and process a policy once rather than process it with every document to which the policy applies. Furthermore, by placing the information about relevant policies in a centralized location, Web site administration is simplified.
A policy reference file is used to associate a policy with a URI. The location of the policy reference file can be indicated using one of two mechanisms. This can be done through an HTML LINK tag, or through the HTTP Extension Framework. The policy reference file specifies the P3P policy that applies to that document. The policy reference file is an [RDF]/[XML] file that can specify the policy for a single Web document, portions of a Web site, or for an entire site. The policy reference file may refer to one or more P3P policies; this allows for a single reference file to cover an entire site, even if different P3P policies apply to different portions of the site.
Note that policies are applied at the level of HTTP entities. An entity, retrieved by fetching a URI, has a P3P policy associated with it. A "page" from the user's perspective may be composed of multiple HTTP entities; each entity may have its own P3P policy associated with it. As a practical note, however, placing many different P3P policies on different entities on a single page may make rendering the page and informing the user of the relevant policies difficult for user agents. Additionally, services SHOULD attempt to craft their policy reference files such that a single policy reference file covers any given "page"; this will speed up the user's browsing experience.
This document does not specify how P3P policies may be associated with documents retrieved by means other than HTTP.
This section describes the syntax used to make policy references using the two supported mechanisms.
P3P makes use of the HTTP Extension Framework [HTTP-EXT]. The HTTP Extension Framework allows new HTTP headers to be defined and used. All HTTP headers associated with a given extension in a request or response are to be prefixed by an arbitrary two-digit namespace identifier. This prefix may be chosen by implementations on a per-message basis. This guarantees a unique namespace for the extension's headers. In addition, the extension must identify itself (with a URI) when it declares the namespace.
The HTTP Extension Framework requires a globally unique URI identifying the extension (the extension declaration). The P3P extension declaration is the following URI:
http://www.w3.org/2000/P3Pv1
Any document retrieved by HTTP may point to a policy reference file through the use of a new response header, the PolicyRef header. The PolicyRef header contains the URI of a policy reference file, which will state the P3P policy covering the document that pointed to the reference file, and possibly others as well. This URI MUST NOT be used for any other purpose beyond identifying and referencing P3P policies.
The P3P extension declaration and policy header SHOULD be inserted whenever a P3P-enabled server responds to a relevant request, including when it responds to HEAD and OPTIONS requests.
As it is possible for non-P3P-enabled user agents to properly interpret and process responses which include the P3P policy reference, P3P is an "optional" extension in the terms of the HTTP Extension Framework. Since policy references may be processed by agents anywhere along the response chain, P3P is an end-to-end HTTP extension. Thus, the header used to declare the P3P extension will be Opt.
The header syntax is:
[1] | policy-reference-header |
= |
nsprefix `-PolicyRef: ` URI |
Here, URI is defined as per RFC 2396 [URI]. nsprefix is the two-digit namespace declaration selected for the P3P headers in this message, according to [HTTP-EXT]. It may be any two-digit number that does not conflict with other namespace declarations in the response. |
In keeping with the rules for other HTTP headers, the PolicyRef portion of this header may be written in any case.
1. Client makes a GET request.
GET /index.html HTTP/1.1 Host: catalog.example.com Accept: */* Accept-Language: de, en User-Agent: WonderBrowser/5.2 (RT-11)
2. Server returns content and the PolicyRef header pointing to the policy of the page.
HTTP/1.1 200 OK Opt: "http://www.w3.org/2000/P3Pv1"; ns=11 11-PolicyRef: http://catalog.example.com/P3P/PolicyReferences.xml Content-Type: text/html Content-Length: 7413 Server: CC-Galaxy/1.3.18
Servers may serve HTML content with embedded link tags that indicate the location of the relevant P3P policy reference file. This use of P3P does not require a P3P-aware server; content may be modified to include the embedded link tags without requiring any changes to the way the server operates.
The link tag encodes the information that could be expressed using the P3P PolicyRef header. The link tag takes the following form:
[2] | p3p-link-tag |
= |
`<link rel="P3Pv1" href="` URI `">`` |
Here, URI is defined as per RFC 2396 [URI]. |
For example, the policy reference expressed in Example 2.1 using HTTP headers could be expressed equally well by including in the web page http://catalog.example.com/index.html the following piece of HTML:
<link rel="P3Pv1" href="http://catalog.example.com/P3P/PolicyReferences.xml">
Note that if user agents handle HTML, they MUST handle both mechanisms (policy references in HTTP headers or in link tags) interchangeably; neither of the two mechanisms overrides the other. See also the requirements for non-ambiguity.
A policy reference file is used to associate P3P policies with certain regions of URI-space. Regardless of the mechanism documents use to link to a policy reference file, the syntax of that reference file remains the same. The policy reference file is used to make any or all of the following statements:
The first four statements are made in the body of the policy reference file. The last is made using HTTP expiration headers on the policy reference file.
Consider the case of a Web site wishing to make the following statements:
These statements could be represented by the following [RDF]:
<POLICY-REFERENCES xmlns="http://www.w3.org/2000/P3Pv1" xmlns:web="http://www.w3.org/1999/02/22-rdf-syntax-ns#" > <web:RDF> <POLICY-REF web:about="/P3P/Policy1.p3p"> <PREFIX>/</PREFIX> <EXCLUDE>/catalog/</EXCLUDE> <EXCLUDE>/cgi-bin/</EXCLUDE> <EXCLUDE>/servlet/</EXCLUDE> </POLICY-REF> <POLICY-REF web:about="/P3P/Policy2.p3p"> <PREFIX>/catalog/</PREFIX> </POLICY-REF> <POLICY-REF web:about="/P3P/Policy3.p3p"> <PREFIX>/cgi-bin/</PREFIX> <PREFIX>/servlet/</PREFIX> <EXCLUDE>/servlet/unknown</EXCLUDE> </POLICY-REF> </web:RDF> </POLICY-REFERENCES>
To indicate that the claims made by this policy reference file are good for 8 hours, the origin server serving this page would return a Cache-Control: max-age=28800 header with this file. Alternatively, the origin server could generate an Expires header dated 8 hours past the Date header in the response.
This section defines the syntax and semantics of P3P policy reference files. All policies MUST be encoded using [UTF-8]. P3P servers MUST encode their policy references using this syntax. P3P user agents MUST be able to parse this syntax.
One significant point to make about the syntax of policy reference files is that the syntax defined here does not have an extension mechanism. The syntax for P3P policies has a powerful extension mechanism, but that mechanism is not supported for policy reference files.
The lifetime of a policy reference file tells user agents how long they can rely on the claims made in the reference file. For example, if a policy reference file has a lifetime of 16 hours, then a user agent need not reload that file for 16 hours, and can assume that the references made in that reference file are good for 16 hours. All of the policy references made in a single policy reference file will receive the same lifetime. The only way to specify different lifetimes for P3P policies is to use separate policy reference files for each policy.
The lifetime of a policy reference file is determined by the HTTP cache control headers served with the reference file. However, user agents MUST NOT use heuristic expiration based on last-modified to compute a lifetime for the reference file. User agents MUST compute a lifetime for the policy reference file based on Expires, Cache-Control, or Pragma headers served with the file if they are available. The semantics of these headers are defined by [HTTP]. If none of these headers is available, the lifetime MUST be set to 24 hours from the time the document was sent from the origin server. Origin servers SHOULD use one of the headers listed above to give an explicit lifetime for their policy reference files.
The possible presence of caches in the network and the heuristic expiration mechanism in HTTP considerably complicates lifetime considerations. Consider the case of policy reference files which have no explicit cache lifetime defined by the origin server (i.e., none of the headers listed above are included in the response). A network caches will, in all likelihood, compute a cache lifetime for the policy reference file based on its last-modified date; the resulting cache lifetime could be significantly longer that 24 hours. If a user agent then retrieves this policy reference file from an HTTP 1.0 cache, the user agent has no way to know how long the reference file may have been in the cache. It would then be impossible for the user agent to determine if the reference file's lifetime has already expired, or when it will expire. HTTP 1.1 caches improve the situation somewhat, as HTTP 1.1-compliant caches MUST send an Age header when serving a request from their cache. However, even this is not sufficient; the cache could return a file with an age exceeding the 24-hour lifetime defined here, resulting in a useless policy reference file. To avoid these problems, user agents MUST insure that they load a fresh copy of the policy reference file when it is fetched. Thus, a user agent MUST include either a Pragma: no-cache or a Cache-Control: no-cache request-header when fetching a policy reference file. The former is suggested for compatability with HTTP 1.0 caches.
Note that it is impossible for a client to accurately predict the amount of latency that may affect an HTTP request. Thus, if the policy reference file covering a request is going to expire soon, clients MAY wish to consider warning their users and/or revalidating the policy reference file before continuing with the request.
The POLICY-REFERENCES element contains a complete policy reference file. Exactly one POLICY-REFERENCES element MUST be in a policy reference file. This element MUST contain one RDF element. It MAY optionally contain one or more extensions.
[3] | policies |
= |
`<POLICY-REFERENCES xmlns="http://www.w3.org/2000/P3Pv1" ` rdf-ns-def `>` rdf `</POLICY>` |
[4] | rdf-ns-def |
= |
xmlns `:` rdf-ns-prefix `="http://www.w3.org/1999/02/22-rdf-syntax-ns#"` |
[5] | rdf-ns-prefix |
= |
NCName |
Here NCName is defined in Namespaces in XML [Namespace] |
The RDF element encapsulates RDF expressions in a policy reference file. This element MUST contain one or more POLICY-REF (policy reference) elements.
[6] | rdf |
= |
`<` rdf-ns-prefix `:` `RDF>` policy-ref `</` rdf-ns-prefix `:` `RDF/>` |
A policy reference file may refer to multiple P3P policies, specifying information about each. The POLICY-REF element is an RDF resourece, and describes attributes of a single P3P policy. Elements within the POLICY-REF element give the location of the policy and specify the areas of URI-space that each policy covers.
[7] | policy-ref |
= |
`<POLICY-REF ` rdf-ns-prefix `:` `about="` URI `">` *prefix *exclude *method-element `</POLICY-REF>` |
Here, URI is defined as per RFC 2396 [URI]. |
Each PREFIX or EXCLUDE element specifies one local URI-prefix. They are used to specify the portions of the Web site that is covered by the policy mentioned by the enclosing POLICY-REF element.
When PREFIX (and optionally, EXCLUDE) elements are present in a POLICY-REF element, it means that the policy specified in the space attribute of the POLICY-REF element applies to all the URIs at the requested host corresponding to the local-URI(s) specified by the PREFIXes, but not specified by an EXCLUDE element.
If no PREFIX element is included in a POLICY-REF element, it MUST be implicitly assumed that the policy given by the href applies to the resource which linked to this policy reference file. It is an error for a policy reference file to contain two or more POLICY-REF elements without PREFIX elements. It is legal, but pointless, to supply an EXCLUDE element without any PREFIX elements; in that case, the EXCLUDE elements MUST be ignored by user agents.
A policy reference file can only cover URIs on the same host as the reference file. Therefore, the PREFIX and EXCLUDE elements MUST specify only local URI prefixes; they MUST NOT refer to URIs on other hosts. This requirement does NOT apply to the location of the P3P policy file (the href attribute on the POLICY-REF element).
Note that policy reference files do not support any sort of regular expressions. The only mechanism offered in a POLICY-REF element is to refer to the current document (implicitly, by not giving a PREFIX element), or to use a relative URI-prefix. It is not possible, for example, to state that a certain P3P policy applies to all URIs ending with the extension ".asp", or to any URIs containing more than 3 instances of the letter 't'. Additionally, a policy reference file MUST NOT make statements about URIs on hosts other than the one where the policy reference file is located.
Further, notice that PREFIX and EXCLUDE matching is done as a simple string prefix matching. As a result, a missing "/" at the end of a directory prefix might lead to unexpected results. For example, the element <EXCLUDE>/images/logos</EXCLUDE> (notice the missing '/' at the end of the href) will not only exclude all resources in the /images/logos/ subdirectory but also, for example, a file with the relative URI /images/logoschool.jpg!
[8] | prefix |
= |
`<PREFIX>` URI`</PREFIX>` |
[9] | exclude |
= |
`<EXCLUDE>` URI `</EXCLUDE>` |
Here, URI is defined as per RFC 2396 [URI]. |
By default, a policy reference applies to the stated URIs regardless of the method used to access the resource. However, a Web site may wish to define different P3P policies depending on the method to be applied to a resource. For example, a site may wish to collect more data from users when they are performing PUT or DELETE methods than when performing GET methods.
The METHOD element in a policy reference file is used to state that the enclosing policy reference only applies when the specified methods are used to access the referenced resources. The METHOD element may be repeated to indicate multiple applicable methods. If the METHOD element is not present in a POLICY-REF element, then that POLICY-REF element covers the resources indicated regardless of the method used to access them.
So, to state that /P3P/Policy1.p3p applies to all documents in the subtree /docs/ for GET and HEAD methods, while /P3P/Policy2.p3p applies for PUT and DELETE methods, the following policy reference would be written:
<POLICY-REFERENCES xmlns="http://www.w3.org/2000/P3Pv1" xmlns:web="http://www.w3.org/1999/02/22-rdf-syntax-ns#" > <web:RDF> <POLICY-REF web:about="/P3P/Policy1.p3p"> <PREFIX>/docs/</PREFIX> <METHOD>GET</METHOD> <METHOD>HEAD</METHOD> </POLICY-REF> <POLICY-REF web:about="/P3P/Policy2.p3p"> <PREFIX>/docs/</PREFIX> <METHOD>PUT</METHOD> <METHOD>DELETE</METHOD> </POLICY-REF> </web:RDF> </POLICY-REFERENCES>
[10] | method-element |
= |
`<METHOD>` Methods `</METHOD>` |
Here, Method is defined in the section 5.1.1 of [HTTP1.1]. |
Policy reference files may refer directly or indirectly to policies. A direct reference to a policy is a policy URI which, when fetched, returns the XML document which makes up that policy. An indirect reference to a policy is a policy URI which, when fetched, returns a new policy URI. The new policy URI returned by an indirect reference MAY, itself, be an indirect reference, though this is discouraged for performance reasons.
Direct and indirect references are recognized by the HTTP return code given by the server when fetched. When the URI of a direct policy reference is fetched, the server SHOULD return a 200-class HTTP return code or a 301 (Moved Permanently) HTTP return code (or an error code, if appropriate). It MUST NOT give a 302 (Found), return code as a response. When the URI of an indirect policy reference is fetched, a 302 return code MUST be given, unless an error (400- or 500-class) return code is appropriate. When a 302 return code is returned, it MUST include a Location response header giving the actual policy URI.
Services MAY choose to use direct or indirect policy references as appropriate (so long as the requirements under Immutability of Policies are respected). A direct policy reference will result in the best performance for user agents that are processing those policies. Due to the immutability rule, if a user agent receives a direct policy reference to a URI that it has already fetched, then no additional network activity is required in order to process that policy. This results in quicker response time for the user agent.
Indirect policy references require at least one additional network round trip to locate the actual policy. This results in reduced performance for the user agent. However, it allows for more flexible policy deployment for certain organizations. An example will assist in illustrating:
Imagine that an imaginary company, CatalogExample, is establishing a worldwide Web presence. Its default Web site, www.catalog.example.com, provides links to a number of country-specific sites. For purposes of this example, assume that CatalogExample starts by deploying four localized sites: usa.catalog.example.com (USA), www.catalog.example.co.uk (United Kingdom), www.catalog.example.com.ru (Russia), and www.catalog.example.com.jp (Japan). Let us assume that each of these sites has their content developed locally. This allows the sites to be better tailored to their local audiences.
However, the CatalogExample company has decided that it will have a single privacy policy that will apply to all of their sites around the world. They could do this by deploying that privacy policy on their master Web site (www.catalog.example.com), and having pages on their localized servers reference that policy. When CatalogExample company wishes to update their privacy policy, then by the Immutability of Policies rule, they must place that policy at a new URI. Then the policy references on all of their sites must be changed. This will probably involve work by several Webmasters and Webmistresses in various parts of the globe. The problem becomes far worse when CatalogExample expands to more of the world, and has perhaps 20 or 50 localized Web sites.
Indirect policy references are intended as a solution to this management problem. Each of the local CatalogExample sites can contain a policy URI pointing to the main CatalogExample server. Fetching this URI returns a reference to the currently-applicable privacy policy. For example, imagine that the CatalogExample company wants to gather customers' e-mail addresses to send them a note listing weekly specials. Each of the local servers could use an indirect policy URI of http://www.catalog.example.com/privacy/P3P/policy-weeklyspecial. Resolving this URI would then return a link to the actual privacy policy; perhaps this might be http://www.catalog.example.com/privacy/P3P/policy-weeklyspecial-3.xml. Now, when the corporation wishes to update the privacy policy that applies to the weekly special registration form, they need only update in a single location, regardless of how many servers reference that policy.
In general, services should use direct policy references whenever it is feasible. Indirect policy references are expected to be used only by organizations with large and diverse Web presences.
Note that services SHOULD make indirect policy references only across URIs that are under the same organizational control, to help insure the accuracy of the policy statement. However, there is no technical means to enforce this requirement. Indirect policy references MAY be to URIs on other hosts or even in other domains, depending on the structure of an organization's Web presence.
When a user agent receives a policy reference, there is no way for it to tell if it is a direct or indirect policy reference. To process the policy properly, the user agent MUST fetch the URI specified in the policy reference. If that reference returns a 302 (Found) return code, then the user agent MUST fetch the URI given in the Location header to locate the actual policy in order to process the policy. Note that once a policy is fetched by direct reference, it need not be fetchd again (as long as the user agent records the relevant information). However an indirect reference requires rechecking to make sure it has not changed (unless an Expires or Cache-Control HTTP header indicates that it has not changed).
HTML pages often contain links to other resources that are directly embedded in the page, such as images, sounds, layers or frames. Thus, in order to render the page, the user agents needs to make additional requests which might or might not be covered by the policy in effect for the page that is currently laid out.
As described in section 2.1.1 Mechanisms, the preferred method for such situations is to declare all policies in effect in a single policy reference file. If the policy reference file does not contain a POLICY-REF covering a given URI that links to embedded content (e.g., the src-attribute of an IMG-tag) it SHOULD initially assume that NO policy is in effect for the given resource. However, user agents MAY wish to issue a HEAD request to such a URI before actually requesting the resource, in order to find the policy in effect.
A very important rule of policy references is that of non-ambiguity: For each resource at a Website there MUST be at most one policy active at any given time. Thus two policy reference files on a given site MUST NOT declare two or more different policy URIs for the same resource.
While the need to check for such ambiguities within a single policy reference file is obvious, user agents MAY also track policy declarations across an entire website, in order to detect ambiguities in policy declarations.
See also the section on Immutability of Policies for a discussion of non-ambiguity over time (immutability).
Multiple language versions (translations) of the same policy can be offered
by the server using the HTTP "Content-Language
" header to properly
indicate that a particular language has been used for the policy. This is
useful so that human-readable fields such as entity and consequence can be
presented in multiple languages. The same mechanism can also be used to offer
multiple language versions for data schemas. Whenever
Content-Language
is used to distinguish policies at the same
URI that are offered in multiple languages, the policies MUST have the same
meaning in each language.
An essential requirement on policies is the so-called immutability of policies: with one exception, policies that are directly referenced at a certain URI cannot be changed. This way, the URI of a policy acts like a unique identifier for the policy, and any new policy must therefore use a new different URI. The only exception to this general principle is when multiple language versions (translations) of the same policy are offered by the server using the HTTP "Content-Language" tag.
P3P clients MAY check for immutability of policies, by comparing a cached version of a policy (and its Content-Language if present) with the corresponding freshly retrieved policy (and Content-Language if present). If a user agent discovers that the two policies are different but retain the same URI, then it MUST treat the resource covered by the changed policy as if it has no P3P policy, UNLESS they have two different values of Content-Language.
Note that immutability of policies only holds to policies that are directly referenced: the URI returned when an indirect policy reference is fetched MAY change over time; after all, this is the purpose behind indirect policy references. Indirect policy references MUST NOT be changed into direct policy references; if this is desired, a new policy-URI MUST be used.
Every P3P-enabled user agent and service SHOULD ensure that all the relevant communications that take place as part of fetching a P3P policy are part of a special "safe zone" in which minimal data collection takes place and any data that is collected is used only in non-identifiable ways.
To support this safe zone, P3P user agents SHOULD suppress the transmission of data unnecessary for the purpose of finding a site's policy until the policy has been fetched. Thus user agents SHOULD NOT send the HTTP Referer header, cookies, or user agent information while requesting a P3P policy.
In addition, P3P user agents MAY issue a HEAD request to a site in order to learn the location of the relevant policy before making other requests. This is a useful way to obtain a site's policy without making a request that could result in the transmission of data. However, since it might be possible for sites to detect the user's identify from the Accept-Language header (compare the [HTTP1.1] Specification, section 15.1.4 Privacy Issues Connected to Accept Headers), HEAD requests may be issued without an Accept-Language header in order to get the machine-readable part of the policy, and only if that is reasonably satisfactory, the policy in the appropriate language is fetched if necessary.
Servers SHOULD NOT require the receipt of an HTTP Referer header, cookies, user agent information, or other information unnecessary for responding to the request in order to serve a policy file. In addition, servers SHOULD NOT use in an identifiable way any information collected while serving a policy file or responding to a HEAD request.
Servers MAY return a PolicyRef header in the response headers when a P3P policy is requested. However, it is important to note that the PolicyRef header MUST be ignored, and that the "safe zone" requirements described in this section apply instead. Returning a PolicyRef header in such cases is permitted in consideration of the fact that administrators may find it easier to apply a P3P policy to all documents on a server, and that requiring policies to be served without a PolicyRef header may result in extra work for site administrators.
Note that the safezone requirements do not say that sites cannot keep identifiable information -- only that they SHOULD NOT use in an identifiable way any information collected while serving a policy file. Tracking down the source of a denial of service attack, for example, would be a legitimate reason to use this information and ignore the SHOULD.
There are two important further requirements on the server side:
P3P policies and references to P3P policies SHOULD NOT, in themselves, contain any sensitive information. This means that there are no additional security requirements for transporting a reference to a P3P policy beyond the requirements of the document it is associated with; so, if an HTML document would normally be served over a non-encrypted session, then the P3P protocol would not require nor recommend that the document be served over an encrypted session when a reference to a P3P policy is included with that document.
Section 3.1 begins with an example of an English language privacy policy and a corresponding P3P policy in Section 3.1. P3P policies include general assertions that apply to the entire policy as well as specific assertions -- called statements -- that apply only to the handling of particular types of data referred to by data references. Section 3.2 describes the policy element and policy-level assertions. Section 3.3 describes statements and data references.
In the sections that follow a number of XML elements are introduced. Each element is given in <> brackets, followed by a list of valid attributes. All listed attributes are optional, except when tagged as mandatory. Note that many XML elements are shown in the BNF with separate beginning and ending tags, to allow optional elements inside them. If no elements are included, then, following standard XML rules, a self-closing element may be used instead.
The following is an example of an English-language privacy policy to be encoded as a P3P policy.
CatalogExample, of 50 Whitfield Road, Bethesda, MD 20814, USA, makes the following statement for the Web page at http://www.catalog.example.com/catalog/. We have a privacy seal from PrivacySeal.example.org. Our privacy policy is posted at http://www.catalog.example.com/PrivacyPractice.html. We do not provide access capabilities to information we may have about you.We use cookies and collect your gender, information about your clothing preferences, and (optionally) your home address to customize our entry catalog pages and for our own research and product development. We retain this information indefinitely.
We also maintain server logs that include information about visits to the http://www.catalog.example.com/catalog/ page, and the types of browsers our visitors use. We use this information in order to maintain and improve our web site. We retain this information indefinitely.
The following is a more formal description, using the P3P element and attribute names:
Entity: CatalogExample, 50 Whitfield Road, Bethesda, MD 20814, USADisputes:
resolution type: independent
service: http://www.privacyseal.example.org
description:PrivacySeal.example.orgDisclosure:
Disclosure URI: http://www.catalog.example.com/PrivacyPractice.html
Access to Identifiable Information: none
We may collect:
dynamic.cookies (category = state)
user.gender
dynamic.miscdata (category = pref)
user.home. (optional)
For purpose: Customization of the site to individuals, research and development
Retention: Indefinitely
Recipients: Only ourselves and our agents
Consequence: A site with clothes you would appreciateWe collect:
dynamic.clickstream.server
dynamic.http.useragent
For purpose: Web site and system administration, research and development
Retention: Indefinitely
Recipients: Only ourselves and our agents
The following piece of [XML] captures the information as expressed above. P3P policies are statements that are properly expressed as well-formed XML. The policy syntax will be explained in more detail in the sections that follow.
Example 3.1
<POLICY xmlns="http://www.w3.org/2000/P3Pv1" discuri="http://www.catalog.example.com/PrivacyPractice.html"> <ENTITY> <DATA-GROUP> <DATA ref="#business.name">CatalogExample</DATA> <DATA ref="#business.contact-info.postal.street.line1">50 Whitfield Road</DATA> <DATA ref="#business.contact-info.postal.city">Bethesda</DATA> <DATA ref="#business.contact-info.postal.stateprov">MD</DATA> <DATA ref="#business.contact-info.postal.postalcode">20814</DATA> <DATA ref="#business.contact-info.postal.countrycode">US</DATA> </DATA-GROUP> </ENTITY> <DISPUTES-GROUP> <DISPUTES resolution-type="independent" service="http://www.PrivacySeal.example.org" short-description="PrivacySeal.example.org"> <IMG src="http://www.PrivacySeal.example.org/Logo.gif"/> </DISPUTES> </DISPUTES-GROUP> <ACCESS><none/></ACCESS> <STATEMENT> <CONSEQUENCE>A site with clothes you would appreciate</CONSEQUENCE> <RECIPIENT><ours/></RECIPIENT> <PURPOSE><custom/><develop/></PURPOSE> <RETENTION><indefinitely/></RETENTION> <DATA-GROUP> <DATA ref="#dynamic.cookies"><CATEGORIES><state/></CATEGORIES></DATA> <DATA ref="#dynamic.miscdata"><CATEGORIES><preference/></CATEGORIES></DATA> <DATA ref="#user.gender"/> <DATA ref="#user.home." optional="yes"/> </DATA-GROUP> </STATEMENT> <STATEMENT> <RECIPIENT><ours/></RECIPIENT> <PURPOSE><admin/><develop/></PURPOSE> <RETENTION><indefinitely/></RETENTION> <DATA-GROUP> <DATA ref="#dynamic.clickstream.server"/> <DATA ref="#dynamic.http.useragent"/> </DATA-GROUP> </STATEMENT> </POLICY>
This section defines the syntax and semantics of P3P policies. All policies MUST be encoded using [UTF-8]. P3P servers MUST encode their policies using this syntax. P3P user agents MUST be able to parse this syntax.
The POLICY element contains a complete P3P policy. Each P3P policy MUST contain exactly one POLICY element. The policy element MUST contain an ENTITY element that identifies the legal entity making the representation of the privacy practices contained in the policy. In addition, the policy element MUST contain an ACCESS element, at least one STATEMENT element, and optionally a DISPUTES-GROUP element and one or more extensions.
[11] | policy |
= |
`<POLICY xmlns="http://www.w3.org/2000/P3Pv1" discuri=` quoted-URI `>` *extension entity access [disputes-group] 1*statement-block *extension `</POLICY>` |
[12] | quoted-URI |
= |
`"` URI `"` |
Here, URI is defined as per RFC 2396 [URI]. |
The ENTITY element gives a precise description of the legal entity making the representation of the privacy practices.
The ENTITY element contains a description of the legal entity consisting of DATA elements referencing (all or part of) the fields of the business dataset: it MUST contain both the legal entity's name as well as contact information such as postal address, telephone number, email address, or other information that individuals may use to contact the entity about their privacy policy. Note that some laws and codes of conduct require entities to include a postal address or other specific information in their contact information.
[13] | entity |
= |
"<ENTITY>" *extension entitydescription *extension "</ENTITY>" |
[14] | entitydescription |
= |
<DATA-GROUP> `<DATA ref="#business.name"/>` PCDATA </DATA> *(`<DATA ref="#business." string `/>` PCDATA </DATA>) </DATA-GROUP> |
Here, string is defined as a [UTF-8] string (with " and & escaped) among the values that are allowed by the business dataset. PCDATA is defined as in [XML]. |
The ACCESS element indicates whether the site provides access to various kinds of information.
Note that service providers may also wish to provide capabilities to access information collected through means other than the Web at the discuri. However, the scope of P3P statements are limited to data collected through HTTP or other Web transport protocols. Also, if access is provided through the Web, use of strong authentication and security mechanisms for such access is recommended; however, security issues are outside the scope of this document.
The ACCESS element must contain one among the following elements:
[15] | access |
= |
"<ACCESS>" access_disclosure *extension </ACCESS> |
[16] | access-disclosure |
= |
"<nonident/>" | ; Identifiable Data is Not Used "<contact/>" | ; Identifiable Contact Information "<other_ident/>" | ; Other Identifiable Information |
A policy SHOULD contain a DISPUTES-GROUP element, which contains one or more DISPUTES elements. These elements describe dispute resolution procedures that may be followed for disputes about a services' privacy practices. Each DISPUTES element can optionally contain a LONG-DESCRIPTION and IMG tag, containing respectively a long human-readable description and an image logo, and a a REMEDIES element that specifies the possible remedies in case a policy breach occurs.
The DISPUTES element can contain a LONG-DESCRIPTION element, where a human readable description is present: this should contain the name of the appropriate legal forum, applicable law, or third party organization; or contact information for customer service if not already provided at the service URI.
[17] | disputes-group |
= |
"<DISPUTES-GROUP>" *extension 1*dispute *extension "</DISPUTES-GROUP>" |
[18] | dispute |
= |
"<DISPUTES" " resolution-type=" '"'("service"|"independent"|"court"|"law")'"' " service=" quoted-URI [" verification=" quoted-string] [" short-description=" quoted-string] "/>" [longdescription] [image] [remedies] *extension "</DISPUTES>" |
[19] | longdescription |
= |
<LONG-DESCRIPTION> PCDATA </LONG-DESCRIPTION> |
[20] | image |
= |
"<IMG src=" quoted-URI [" width=" `"` number `"`] [" height=" `"` number `"`] [" alt=" quoted-string] "/>" |
[21] | quoted-string |
= |
`"` string `"` |
Here, string is defined as a [UTF-8] string (with " and & escaped), and PCDATA is defined as in [XML]. |
Note that there can be multiple assurance services, specified via multiple occurrences of DISPUTES within the DISPUTES-GROUP element. These fields are expected to be used in a number of ways, from representing that one's privacy practices are self assured, audited by a third party, or under the jurisdiction of a regulatory authority.
Each DISPUTES element SHOULD contain a REMEDIES element
that specifies the possible remedies in case a policy breach occurs.
The REMEDIES element must contain one or more among the following:
[22] | remedies |
= |
"<REMEDIES>" 1*remedy *extension "</REMEDIES>" |
[23] | remedy |
= |
"<correct/>" | "<money/>" | "<law/>" |
Statements describe data practices that are applied to particular types of data.
The STATEMENT element is a container that groups together a PURPOSE element, a RECIPIENT element, a DATA-GROUP element, and optionally a CONSEQUENCE element and one or more extensions. All of the data referenced by the DATA-GROUP is handled according to the disclosures made in the other elements contained by the statement. Thus, sites may group elements that are handled the same way and create a statement for each group. Sites that would prefer to disclose separate purposes and other information for each kind of data they collect can do so by creating a separate statement for each data element.
[24] | statement-block |
= |
"<STATEMENT>" *extension [consequence] purpose recipient retention 1*data-group *extension "</STATEMENT>" |
To simplify practice declaration, service providers may aggregate any of the disclosures (purposes, recipients, and identifiable use) within a statement over data elements. Service providers MUST make such aggregations as an additive operation. For instance, a site that distributes your age to ours (ourselves and our agents), but distributes your zip code to published (unrelated third parties), MAY say they distribute your name and zip code to ours and unrelated. Such a statement appears to distribute more data than actually happens. It is up to the service provider to determine if their disclosure deserves specificity or brevity.
Also, one must always disclose all options that apply. Consider a site with the sole purpose of collecting information for the purposes of contact (Contacting Visitors for Marketing of Services or Products). Even though this is considered to be for the current (Completion and Support of Current Activity) purpose, the site must state both contact and current purposes. Consider a site which distributes information to ours in order to redistribute it to public: the site must state both ours and public recipients.
STATEMENT elements may optionally contain a CONSEQUENCE element that can be shown to a human user to provide further explanation about a site's practices.
[25] | consequence |
= |
"<CONSEQUENCE>" PCDATA "</CONSEQUENCE>" |
Each STATEMENT element MUST contain a PURPOSE element that contains one or more purposes of data collection or uses of data. Sites MUST classify their data practices into one or more of the six specified purposes.
The PURPOSE element must contain one or more among the following:
Each type of purpose can have the following optional attribute:
[26] | yesno |
= |
"yes" | "no" |
[27] | purpose |
= |
"<PURPOSE>" 1*purposevalue *extension "</PURPOSE>" |
[28] | purposevalue |
= |
"<current" [change] "/>" | ; Completion and Support of Current Activity "<admin" [change] "/>" | ; Web Site and System Administration "<develop" [change] "/>" | ; Research and Development "<contact" [change] "/>" | ; Contacting Visitors for Marketing of Services or Products "<customization" [change] "/>" | ; Affirmative Customization "<targeting" [change] "/>" | ; One-time Targeting "<profiling" [change] "/>" | ; Individual Profiling "<other-purpose" [change] " >" PCDATA "</other-purpose>"; Other Uses |
[29] | change |
= |
" change_preferences=" `"` yesno `"` |
Service providers MUST use the above elements to explain the purpose of data collection. Service providers MUST disclose all that apply. If a service provider does not disclose that a data element will be used for a given purpose, that is a representation that data will not be used for that purpose. Service providers that disclose that they use data for "other" purposes MUST provide human readable explanations of those purposes.
Note, that the working group discussed at legnth the possibility of allowing sites to distinguish between purposes they may engage in and purposes they will engage in. The consensus of the working group was that such a distinction is not necessary. However, some members disagreed with this conclusion stating:
Yes, no and may all need to be response options in the vocabulary. If no and may are the only options, then the meaning of may is corrupted to equal yes. May should be an option that reflects its true meaning -- yes or no. If may by default means yes, because yes is not provided as a response option, the consumer will be misled. May should be used to imply that there are a set of rules underlying the term that consumers can refer to understand a privacy policy. If may means yes, the consumer is less likely to investigate via a click-through to the Web site's privacy policy. Potentially, this seemingly simple solution -- no and may -- will be a significant barrier to commerce as consumers are confused by the meaning of the truncated choices of only no and may. Those who argue that providing all three choices -- yes, may, no -- is an attempt by Web sites to mislead consumers are missing the point. In the arena of privacy protection, accuracy in stating a privacy policy is critical to building trust and confidence in the consumer about how information is used. In the interest of software simplicity, limiting consumer preference choices to no and may will do a disservice to the consumer -- and to the Web sites that are trying to communicate accurately with consumers about their policies.
Each STATEMENT element MUST contain a RECIPIENT element that contains one or more recipients of the collected data. Sites MUST classify their recipients into one or more of the six recipients specified.
The RECIPIENT element must contain one or more among the following:
[30] | recipient |
= |
"<RECIPIENT>" 1*recipientvalue *extension "</RECIPIENT>" |
[31] | recipientvalue |
= |
"<ours/>" | ; only ourselves and our agents "<same/>" | ; legal entities following our practices "<other-recipient/>" | ; legal entities following different practices "<delivery/>" | ; delivery services following different practices "<public/>" | ; public fora "<unrelated/>" ; unrelated third parties |
Service providers MUST disclose all the recipients that apply. Note that in some cases the above set of recipients may not completely describe all the recipients of data. For example, the issue of transaction facilitators, such as shipping or payment processors, who are necessary for the completion and support of the activity but may follow different practices was problematic. Currently, only delivery services can be explicitly represented in a policy. Other such transaction facilitators should be represented in whichever category most accurately reflects their practices with respect to the original service provider. The working group decided to include a special element for delivery services, but not for payment processors (such as banks or credit card companies) for the following reasons: Financial institutions will typically have separate agreements with their customers regarding the use of their financial data, while delivery recipients typically do not have an opportunity to review a delivery service's privacy policy.
Note that the <delivery/>
element SHOULD NOT be used for
delivery services that agree to use data only on behalf of the service provider
for completion of the delivery.
Each STATEMENT element MUST contain a RETENTION element that indicates the kind of retention policy that applies to the data referenced in that statement.
The RETENTION element must contain one of the following:
[32] | retention |
= |
"<RETENTION>" retentionvalue *extension "</RETENTION>" |
[33] | retentionvalue |
= |
"<no-retention/>" | ; not retained "<stated-purpose/>" | ; for the stated purpose "<legal-requirement/>" | ; stated purpose by law "<indefinitely/>" | ; indeterminated period of time "<business-practices/>" ; by business practices |
Each STATEMENT element MUST contain at least one DATA-GROUP element that contains one or more DATA elements. DATA elements are used to describe the type of data that a site collects.
The following six attributes are only used when a new (not defined in the P3P [Base Data Schema]) data element or set is referenced:
DATA elements can contain the actual data (like we have already seen in the case of the ENTITY element), and can contain related category information. When used in a dataschema, they can also contain a long description of the data element/set, using the LONG-DESCRIPTION element.
[34] | data-group |
= |
"<DATA-GROUP" |
[35] | data-reference |
= |
"<DATA" ( " ref=`" URI-reference "`" [" optional=" yesno] ">" [categories] ; the categories of the data element. [PCDATA] ; the eventual value of the data element ) | ( " name=" quoted-string [" typeref=`" URI-reference "`"] [" template=" yesno] [" short-description=" quoted-string] [" size=" `"` number `"`] ) ; default is 0 (unlimited size) ">" [categories] ; the categories of the data element. [longdescription] ; the eventual long description of the data element ) "</DATA>" |
Here, URI-reference is defined as in [URI]. |
For example, to reference the user's home address city, all the elements of the data set user.business. and (optionally) all the elements of the data set user.home.phone., the service would send the following references inside a P3P policy:
<DATA-GROUP> <DATA ref="#user.home.city"/> <DATA ref="#user.home.phone." optional="yes"/> <DATA ref="#user.business."/> </DATA-GROUP>
When the actual value of the data is known, it can be expressed inside the DATA element, like eventual extensions. For example, as seen in the example policy:
<ENTITY> <DATA ref="#business.name">CatalogExample</DATA> <DATA ref="#business.contact-info.postal.street.line1">50 Whitfield Road</DATA> <DATA ref="#business.contact-info.postal.city">Bethesda</DATA> <DATA ref="#business.contact-info.postal.stateprov">MD</DATA> <DATA ref="#business.contact-info.postal.postalcode">20814</DATA> <DATA ref="#business.contact-info.postal.countrycode">US</DATA> </ENTITY>
Categories are elements inside data elements that provide hints to users and user agents as to the intended uses of the data. Categories are vital to making P3P user agents easier to implement and use; they allow users to express more generalized preferences and rules over the exchange of their data. Categories are often included when defining a new element or when referring to data that the user is prompted to type in (as opposed to data stored in the user data repository).
In the current version of P3P, the following elements are used to denote data categories:
[36] | categories |
= |
"<CATEGORIES>" 1*category "</CATEGORIES>" |
[37] | category |
= |
"<physical/>" | ; Physical Contact Information "<online/>" | ; Online Contact Information "<uniqueid/>" | ; Unique Identifiers "<purchase/>" | ; Purchase Information "<financial/>" | ; Financial Information "<computer/>" | ; Computer Information "<navigation/>" | ; Navigation and Click-stream Data "<interactive/>" | ; Interactive Data "<demographic/>" | ; Demographic and Socioeconomic Data "<content/>" | ; Content "<state/>" | ; State Management Mechanisms "<political/>" | ; Political Information "<health/>" | ; Health Information "<preference/>" | ; Preference Data "<other/>" ; Other |
The Computer, Navigation, Interactive and Content categories can be distinguished as follows. The Computer category includes information about the user's computer including IP address and software configuration. Navigation data describes actual user behavior related to browsing. When an IP address is stored in a log file with information related to browsing activity, both the Computer category and the Navigation category should be used. Interactive Data is data actively solicited to provide some useful service at a site beyond browsing. Content is information exchanged on a site for the purposes of communication.
The Other category should be used only when data is requested that does not fit into any other category.
P3P uses categories to give users and user agents additional hints as to what type of information is requested from a service. While most data in the Base Data Schema is in a known category (or a set of known categories), some data elements can be in a number of different categories, depending on the situation. The former are called fixed-category data elements (or "fixed data elements" for short), the latter variable-category data elements ("variable data elements"). Both types of elements are briefly described in the two sections below.
Most of the elements in the base data schema are so called "fixed" data elements: they belong to one or at most two category classes. By assigning a category invariably to elements in the base data schema, services and users are able to refer to entire groups of elements simply by referencing the corresponding category. For example, using [APPEL], the privacy preferences exchange language, users can write rules that prevent their user agent from giving out any data element in a certain category.
When creating data schemas for fixed data elements, schema creators have to explicitly enumerate the categories that these element belong to. For example:
<DATA name="postal.street.line1" typeref="#text"
short-description="Street Address, Line 1" template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
If an element belongs to multiple categories, multiple elements referencing the appropriate categories can be used. For example, the following piece of XML can be used to declare that the data elements in user.name. have both category "physical" and "demographic":
<DATA name="user.name." typeref="#personname."
short-description="User's Name" template="yes">
<CATEGORIES><physical/><demographic/></CATEGORIES>
</DATA>
Please note that the category classes of fixed data elements can not be overridden, for example by writing rules or policies that assign a different category to a known fixed base data element. User Agents MUST ignore such categories and instead use the original category (or set of categories) listed in the schema definition. User Agents MAY preferably alert the user that a fixed data element is used together with a non-standard category class.
Not all data elements in the base data schema belong to a pre-determined category class. Some elements can contain information from a range of categories, depending on a particular situation. Such elements are called variable-category data elements (or "variable data element" for short). Although most variable data elements in the P3P Base Data Schema are combined in the dynamic. element set, they can appear in any data set, even mixed with fixed-category data elements.
When creating a schema definition for such elements, schema authors MUST NOT list an explicit category attribute, otherwise the element becomes a fixed data element. For example when specifying the "Year" data type, which can take various categories depending on the situation (e.g. when used for a credit card expiration date vs. for a birth date), the following schema definition can be used:
<DATA name="date.ymd.year" typeref="#number" size="6"
short-description="Year" template="yes"/> <!-- Variable Data Element -->
This allows new schema extensions that reference such variable-category data types to assign a specific category to derived elements, depending on their usage in that extension. For example, an E-commerce schema extension could thus define a credit card expiration date as follows:
<DATA name="Card.ExpDate." typeref="#date.ymd."
short-description="Card Expiration Date" template="yes">
<CATEGORIES><financial/></CATEGORIES>
</DATA>
Under these conditions, the variable data type date. is assigned a fixed category Financial Account Identifiers when being used for specifying a credit card expiration date.
Note that while user preferences can list such variable data elements without any additional category information (effectively expressing preferences over any usage of this element), services MUST always explicitly specify the categories that apply to the usage of a variable data element in their particular policy. This information has to appear as a category element in the corresponding DATA element listed in the policy, for example as in:
<POLICY ... >
...
<DATA ref="#dynamic.cookies"><CATEGORIES><uniqueid/></CATEGORIES></DATA>
...
</POLICY>
where a service declares that cookies are used for identifying the user at this site (i.e. category Unique Identifiers).
If a service wants to declare a data element that is in multiple categories, it simply declares the corresponding categories (as shown in section 3.4.1 above):
<POLICY ... >
...
<DATA ref="#dynamic.cookies"><CATEGORIES><uniqueid/><preference/></CATEGORIES></DATA>
...
</POLICY>
With the above declaration a service announces that it uses cookies both for identifying the user at this site and for storing user preference data. Note that for the purpose of P3P there is no difference whether this information is stored in two separate cookies or in a single one.
P3P provides a flexible and powerful mechanism to extend its syntax and semantics using one element: EXTENSION. This element is used to indicate portions of the policy which belong to an extension. The meaning of the data within the EXTENSION element is defined by the extension itself.
[38] | extension |
= |
"<EXTENSION" [" optional=" '"' yesno '"'] ">" PCDATA "</EXTENSION>" |
For example, if www.catalog.example.com would like to add to P3P a feature to indicate that a certain set of data elements were only to be collected from users living in the United States, Canada, or Mexico, it could add a mandatory extension like this:
<DATA-GROUP> ... <EXTENSION> <COLLECTION-GEOGRAPHY type="include" xmlns="http://www.catalog.example.com/P3P/region"> <USA/><Canada/><Mexico/> </COLLECTION-GEOGRAPHY> </EXTENSION> </DATA-GROUP>
On the other hand, if www.catalog.example.com would like to add an extension stating what country the server is in, an optional extension might be more appropriate, such as the following:
<POLICY> <EXTENSION optional="yes"> <ORIGIN xmlns="http://www.catalog.example.com/P3P/origin" country="USA"/> </EXTENSION> ... </POLICY>
The xmlns attribute is significant since it specifies the namespace for interpreting the names of elements and attributes used in the extension. Note that, as specified in [XML-Name], the namespace URI is just intended to be a unique identifier for the XML entities used by the extension. Nevertheless, service providers MAY provide a page with a description of the extension at the corresponding URI.
P3P has the ability to define data schemas to provide a common way for services and user agents to refer to data elements. A data schema describes specific data elements, which may be grouped into hierarchical data sets.
Services may declare and use data elements by creating a data schema and referencing it in a policy using the dataschema attribute. P3P comes with a standard data schema, the P3P Base Data Schema, that besides defining a wide variety of commonly used data elements, also provides basic data types, which can be conveniently reused by other new schemas.
The format of a data schema is:
<DATASCHEMA xmlns="http://www.w3.org/2000/P3Pv1"> <DATA ... /> ... <DATA ... /> </DATASCHEMA>
[39] | dataschema |
= |
`<DATASCHEMA xmlns="http://www.w3.org/2000/P3Pv1">` *(data-reference|extension) "</DATASCHEMA>" |
The <DATASCHEMA> element contains references to the new data elements. Such references can be made using the <DATA> tag and the following attributes: name, type, typeschema (which may be omitted if the typeschema is the Base Data Schema), template, category, short, long, size.
For every data element, every missing attribute is presumed to be present with an empty string as defaut value. In the case of the typeschema, the empty string value has the special meaning that the type schema coincides with the namespace of the corresponding DATA element.
For example, suppose the company HyperSpeedExample wants to build the following data schema:
vehicle.model (of primitive type text)
vehicle.color (of primitive type text)
vehicle.built.year (of primitive type number)
vehicle.built.where. (of basic type postal.)
vehicle.price (of primitive type number)
car.model (of primitive type text)
car.color (of primitive type text)
car.built.year (of primitive type number)
car.built.where. (of basic type postal.)
car.price (of primitive type number)
Then, it could place the following code at http://www.HyperSpeed.example.com/models-schema
<DATASCHEMA xmlns="http://www.w3.org/2000/P3Pv1"> <DATA name="vehicle.model" typeref="http://www.w3.org/TR/P3P/base#text" short-description="Model" size="63"> <CATEGORIES><preference/></CATEGORIES></DATA> <DATA name="vehicle.color" typeref="http://www.w3.org/TR/P3P/base#text" short-description="Color" size="63"> <CATEGORIES><preference/></CATEGORIES></DATA> <DATA name="vehicle.built.year" typeref="http://www.w3.org/TR/P3P/base#number" short-description="Construction Year" size="63"> <CATEGORIES><preference/></CATEGORIES></DATA> <DATA name="vehicle.built.where." typeref="http://www.w3.org/TR/P3P/base#postal." short-description="Construction Place" size="63"> <CATEGORIES><preference/></CATEGORIES></DATA> <DATA name="car." typeref="#vehicle."/> </DATASCHEMA>
Note that every time a data schema is created, it can be implicitly used as a type, just like the vehicle. case above.
Continuing with the example, in order to reference a car model and construction year the service could send the following references inside a P3P policy:
<DATA-GROUP> <DATA ref="http://www.HyperSpeed.example.com/models-schema#car.model"/> <DATA ref="http://www.HyperSpeed.example.com/models-schema#car.built.year"/> </DATA-GROUP>
or, using the base attribute, even a more compact
<DATA-GROUP
base="http://www.HyperSpeed.example.com/models-schema">
<DATA ref="#car.model"/>
<DATA ref="#car.built.year"/>
</DATA-GROUP>
In order to provide multilingual support for data schema files, a server can supply the right alternative based on the HTTP Accept-Language header.
Data elements can be classified according to whether or not they are in some fixed category (using the category attribute). Schema designers can use this attribute within their schema definitions to define an invariable category for each element. Once defined, this value cannot be changed when referencing such elements from within user preferences, P3P policies, or other schema definitions. However, if left undefined, this attribute MUST be explicitly listed in each P3P policy referencing such elements. Users can have different preferences depending on different attribute-values for the same element. And in the case of undefined attributes within data types, other schema definitions can explicitly set categories in derived elements (otherwise the original definition overrides any value in the derived schema).
Note that the data element names specified in the base data schema or in extension data schemas may be used for purposes other than P3P policies. For example, web sites may use these names to label HTML form fields. By referring to data the same way in P3P policies and forms, automated form-filling tools can be better integrated with P3P user agents. When P3P data element names are used as HTML form field names, underscores ("_") MUST be used in place of dot notation (e.g. user.name.given must be referenced as user_name_given). This allows interoperability with client-side javascript which also uses the dot notation to access form field names and values.
Analogously to P3P policies, an essential requirement on dataschemas is the immutability of dataschemas: with one exception, dataschemas that can be fetched at a certain URI cannot be changed. This way, the URI of a policy acts like a unique identifier for the dataschema, and usually any new dataschema must therefore use a new different URI. The only exception to this general principle is when multiple language versions (translations) of the same dataschema are offered by the server using the HTTP "Content-Language" tag to properly indicate that a particular language encoding has been used for the dataschema. P3P clients MAY check for immutability of dataschemas, by comparing a cached version of a dataschema (and its Content-Language if present) with the corresponding freshly retrieved dataschema (and Content-Language if present). If a user agent discovers that the two dataschemas are different but retain the same URI, then it MUST treat the resource referencing the changed dataschema as if it has no P3P policy, UNLESS the dataschemas have two different values of their Content-Language.
P3P schemas may refer to the following primitive data element types:
Primitive DataType | Definition |
text | [UTF-8] |
gender | "M" or "F". |
boolean | "false" or "true". |
binary | Base64 per RFC-2045. [MIME] |
number | text composed with the digits "0", "1", "2", "3", "4", "5", "6", "7", "8", "9". |
countrycode | two letters country code as per [ISO3166] |
uri | [URI] |
The Basic Data Types are structured types used by the P3P Base Data Schema (and, possibly, reused by other different data schemas). All P3P-compliant user agent implementations MUST be aware of the Basic Data Types. Each table below specifies the elements of a basic data type, the categories associated, their types, and the display names shown to users. More than one category may be associated with a fixed data element. However, each base data element is assigned to only one category whenever possible. Data schema designers are recommended to do the same.
The date. type is a structured type that specifies a date. Since date information can be used in different ways, depending on the context, all date. information is tagged as being of "variable" category. Schema definitions have to explicitly set the corresponding category in the element referencing this data type. For example, soliciting the birthday of a user might be "Demographic and Socioeconomic Data", while the expiration date of a credit card belongs to the "Financial Account Identifiers" category.
date. | Category | Type | Short display name |
ymd.year | (variable-category) | number | Year |
ymd.month | (variable-category) | number | Month |
ymd.day | (variable-category) | number | Day |
hms.hour | (variable-category) | number | Hour |
hms.minute | (variable-category) | number | Minute |
hms.second | (variable-category) | number | Second |
fractionsecond | (variable-category) | number | Fraction of Second |
timezone | (variable-category) | text | Time Zone |
All the fields in the date. type must be in the same format as those in the most informative profile of the time standard [ISO8601]. Note that "date.ymd." and "date.hms." can be used to fast reference the year/month/day and hour/minutes/seconds blocks respectively.
The personname. type is a structured type that specifies information about the naming of a person.
personname. | Category | Type | Short display name |
prefix | Demographic and Socioeconomic Data | text | Name Prefix |
given | Physical Contact Information | text | Given Name |
family | Physical Contact Information | text | Family Name |
middle | Physical Contact Information | text | Middle Name |
suffix | Demographic and Socioeconomic Data | text | Name Suffix |
formatted | Physical Contact Information, Demographic and Socioeconomic Data | text | Formatted Name |
nickname | Demographic and Socioeconomic Data | text | Nickname |
The certificate. type is a structured type to specify identity certificates (like, for example, X.509).
certificate. | Category | Type | Short display name |
key | Unique Identifiers | binary | Certificate Key |
format | Unique Identifiers | text | Certificate Format |
The "format" field is an IANA registered public key or authentication certificate format, while the "key" field contains the corresponding certificate key.
The phonenum. type is a structured type that specifies the characteristics of a phone number.
phonenum. | Category | Type | Short display name |
intcode | Physical Contact Information | number | International Phone code |
loccode | Physical Contact Information | number | Local Phone Area code |
number | Physical Contact Information | number | Phone Number |
ext | Physical Contact Information | number | Phone Extension |
comment | Physical Contact Information | text | Phone Optional Comments |
The contact. type is a structured type used to specify contact information. Services can specify precisely which set of data they need, postal, telecommunication, or online address information.
contact. | Category | Type | Short display name |
postal. | Physical Contact Information, Demographic and Socioeconomic Data | postal. | Postal Address Information |
telecom. | Physical Contact Information | telecom. | Telecommunications Information |
online. | Online Contact Information | online. | Online Address Information |
The postal. type is a structured type that specifies a postal mailing address.
postal. | Category | Type | Short display name |
name. | Physical Contact Information, Demographic and Socioeconomic Data | personname. | Name |
street.line1 | Physical Contact Information | text | Street Address 1 |
street.line2 | Physical Contact Information | text | Street Address 2 |
street.line3 | Physical Contact Information | text | Street Address 3 |
city | Physical Contact Information | text | City |
stateprov | Physical Contact Information | text | State or Province |
postalcode | Demographic and Socioeconomic Data | text | Postal code |
countrycode | Demographic and Socioeconomic Data | countrycode | Country code |
country | Demographic and Socioeconomic Data | text | Country Name |
organization | Physical Contact Information, Demographic and Socioeconomic Data | text | Organization Name |
formatted | Demographic and Socioeconomic Data | text | Formatted Postal Address |
Using three distinct fields for the street information allows service providers and user agents to split long addresses into multiple lines during solicitation. However, since all fields share the common street. prefix, this shorthand form can be used to reference all three fields at once.
The "formatted" field is used to specify the formatted text corresponding to the delivery address, as it could for example be printed on a label.
The telecom. type is a structured type that specifies telecommunication information about a person.
telecom. | Category | Type | Short display name |
phone. | Physical Contact Information | phonenum. | Phone number |
fax. | Physical Contact Information | phonenum. | Fax number |
mobile. | Physical Contact Information | phonenum. | Mobile Phone number |
pager. | Physical Contact Information | phonenum. | Pager number |
The online. type is a structured type that specifies online information about a person.
online. | Category | Type | Short display name |
Online Contact Information | text | Email Address | |
uri | Online Contact Information | uri | Home Page Address |
All P3P-compliant user agent implementations MUST be aware of the data elements
in the P3P Base Data Schema. The P3P Base Data Schema includes four element
sets, user.
, thirdparty.
,
business.
and dynamic.
. The
user.
, thirdparty.
and business.
sets include elements that users and/or businesses might provide values for,
while the dynamic.
set includes elements that are dynamically
generated in the course of a user's browsing session. User agents may support
a variety of mechanisms that allow users to provide values for the elements
in the user.
set and store them in a data repository, including
mechanisms that support multiple personae. Users may choose not to provide
values for these data elements.
The formal XML definition of the P3P Base Data Schema is given in Appendix 3. In the following sections, we explain one by one the base data elements and sets. The members of this Working Group expect that in the future, there will be demand for the creation of other data sets and elements. Obvious applications include catalogue, payment, and agent/system attribute schemas. (An extensive set of system elements is provided for example in http://www.w3.org/TR/NOTE-agent-attributes.)
Each table below specifies a set, the elements within the set, the category associated with the element, its type, and the display name shown to users. More than one category may be associated with a fixed data element. However, we have tried to assign each base data element to only one category whenever possible. We recommend that data schema designers do the same.
The user.
data set includes general information about
the user.
user. | Category | Type | Short display name |
name. | Physical Contact Information, Demographic and Socioeconomic Data | personname. | User's Name |
bdate. | Demographic and Socioeconomic Data | date. | User's Birth Date |
cert. | Unique Identifiers | certificate | User's Identity Certificate |
gender | Demographic and Socioeconomic Data | gender | User's Gender |
employer | Demographic and Socioeconomic Data | text | User's Employer |
department | Demographic and Socioeconomic Data | text | Department or division of organization where user is employed |
jobtitle | Demographic and Socioeconomic Data | text | User's Job Title |
home. | Physical Contact Information, Online Contact Information, Demographic and Socioeconomic Data |
contact. | User's Home Contact Information |
business. | Physical Contact Information, Online Contact Information, Demographic and Socioeconomic Data |
contact. | User's Business Contact Information |
Note, that this data set includes elements that are actually sets of data themselves. These sets are defined in the data types subsection of this document. The short display name for an individual element contained within a data set is defined as the concatenation of the short display names that have been defined for the set and the element, separated by commas. For example, the short display name for user.home.postal.postalcode would be "User's Home Contact Information, Postal Address Information, Postal code". User agent implementations may prefer to develop their own short display names rather than using the concatenated names when displaying information for the user.
The thirdparty.
data set allows users and businesses
to provide values for a related third party. This can be useful whenever
third party information needs to be exchanged, for example when ordering
a present online that should be sent to another person, or when providing
information about one's spouse or business partner. Such information could
be stored in the user repository alongside with the user.
data
set. User agents may offer to store multiple such thirdparty.
data sets and allow users to select the appropriate values from a list when
necessary.
The thirdparty.
data set is identical with the
user.
data set. See section 4.4.1 User
Data for details.
The business.
data set features a subset of
user.
data relevant for organizations. In P3P 1.0, this data
set is primarily used for declaring the policy entity, though it should also
be applicable to business-to-business interactions.
business. | Category | Type | Short display name |
name | Demographic and Socioeconomic Data | text | Organization Name |
department | Demographic and Socioeconomic Data | text | Department or division of organization |
cert. | Unique Identifiers | certificate | Organizantion Identity Certificate |
contact-info. | Physical Contact Information, Online Contact Information, Demographic and Socioeconomic Data |
contact. | Contact Information for the Organization |
In some cases, there is a need to specify data elements that do not have fixed values that a user might type in or store in a repository. In the P3P Base Data Schema, all such elements are grouped under the dynamic. data set. Sites may refer to the types of data they collect using the dynamic data set only, rather than enumerating all of the specific data elements.
dynamic. | Category | Type | Short display name |
clickstream.client | Navigation and Click-stream Data | text | Click-stream collected on the client |
clickstream.server | Navigation and Click-stream Data | text | Click-stream collected on the server |
cookies | (variable-category) | text | Cookies are processed (read/write) |
http.useragent | Computer Information | text | User Agent information |
http.referrer | Navigation and Click-stream Data | uri | Last URI requested by the user |
miscdata | (variable-category) | text | Miscellaneous non-base data schema information |
searchtext | Interactive Data | text | Search terms |
interactionrecord | Interactive Data | text | Server stores the transaction history |
These elements are often implicit in navigation or Web interactions. They should be used with categories to describe the type of information collected through these methods. A brief description of each element follows.
"clickstream.client" should be used when the server accesses off-line browsing information that has been collected by the user's client. Some versions (e.g. 5.0) of Microsoft's Internet Explorer are known to support such behavior.
"clickstream.server" will probably apply to almost all sites on the Web today. It must be used whenever page access data is kept on the server side. Almost all known Web server implementations today will by default create such an access log, often including origin of the request (IP address or DNS name), time, requested resource, HTTP return code and transferred bytes. Any combination of resource name and originating address should be considered clickstream data (i.e. it allows the reconstruction of a visitors movements through the site) and should be declared.
The logging of referer or user agent information (included in the headers of the HTTP request by many browsers) should explicitly by declared using the http.useragent and http.referrer data elements.
"cookies" should be used whenever information is placed on a user's machine using the HTTP cookie mechanism in order to be "solicited" (i.e. automatically sent) later. Please note that "cookies" is a variable data element and requires the explicit declaration of usage categories in a policy.
"http.useragent" indicates that the server stores additional information about the user agent in its logs, such as operating system, browser software and version.
"http.referrer" indicates that the server stores additional information about the page the user viewed previously, as indicated by the HTTP_REFERER header.
The "miscdata" element references information collected by the service that the service does not reference using a specific data element. Sites MUST reference a separate miscdata element in their policies for each category of miscdata they collect.
"searchtext" is a specific type of solicitation used for searching and indexing sites. For example, if the only fields on a search engine page are search fields, the site only needs to disclose that data element.
The "interactionrecord" element should be used if the server is keeping track of the interaction it has with the user (i.e. information other than clickstream data, for example account transactions, etc). This element is only meant to inform the user that such information will be retained, but does not indicate how long such data will be kept.
Policies that contain one or more of the Variable Data Elements above explicitly declare the category of the information they solicit, for example:
<POLICY ... >
...
<DATA ref="dynamic.miscdata"><CATEGORIES><online/></CATEGORIES></DATA>
...
</POLICY>
when asking for a user's IRC name (which would be in category Online Contact Information).
P3P offers Web sites a great deal of flexibility in how they describe the types of data they collect.
And these three methods may be combined within a single policy.
By using the dynamic.miscdata element, sites can specify the types of data they collect without having to enumerate every individual data element. This may be convenient for sites that collect a lot of data or sites belonging to large organizations that want to offer a single P3P policy covering the entire organization. However, the disadvantage of this approach is that user agents will have to assume that the site might collect any data element belonging to the categories referenced by the site. So, for example, if a site's policy states that it collects dynamic.miscdata of the physical contact information category, but the only physical contact information it collects is business address, user agents will none-the-less assume that the site might also collect phone numbers. If the site wishes to be clear that it does not collect phone numbers or any other physical contact information other than business address, than it should disclose that it collects user.business.postal.. Furthermore, as user agents are developed with automatic form-filling capabilities, it is likely that sites that enumerate the data they collect will be able to better integrate with these tools.
By defining new data schemas, sites can precisely specify the data they collect beyond the base data set. However, if user agents are unfamiliar with the elements defined in these schemas, they will be able to provide only minimal information to the user about these new elements. The information they provide will be based on the category and display names specified for each element.
Regardless of whether a site wishes to make general or specific data disclosures, there are additional advantages to disclosing specific elements from the dynamic. data set. For example, by disclosing dynamic.cookies a site can indicate that it uses cookies and explain the purpose of this use. The working group encourages user agent implementations that offer users cookie control interfaces based on this information. Likewise, user agents that by default do not send the HTTP_REFERER header, might look for the http.referrer element in P3P policies and send the header if it will be used for a purpose the user finds acceptable.
The data schema corresponding to the P3P base data schema follows. In order to improve legibility, the code is indented and aligned along various attribute names. However, note that the whitespace in the actual schema is significant because the content of the document must remain unchanged (immutability of dataschemas). The actual normative schema is present at the URI http://www.w3.org/TR/P3P/base .
<DATASCHEMA xmlns="http://www.w3.org/2000/P3Pv1">
<?xml version='1.0'?>
<DATA-GROUP>
<!-- ********** Base Data Types **********
-->
<!-- "date." Data Type -->
<DATA name="date.ymd.year"
short-description="Year"
typeref="#number" size="6"
template="yes"/> <!-- Variable Data Element
-->
<DATA name="date.ymd.month"
short-description="Month"
typeref="#number" size="2"
template="yes"/> <!-- Variable Data Element
-->
<DATA name="date.ymd.day"
short-description="Day"
typeref="#number" size="2"
template="yes"/> <!-- Variable Data Element
-->
<DATA name="date.hms.hour"
short-description="Hour"
typeref="#number" size="2"
template="yes"/> <!-- Variable Data Element
-->
<DATA name="date.hms.minute"
short-description="Minutes"
typeref="#number" size="2"
template="yes"/> <!-- Variable Data Element
-->
<DATA name="date.hms.second"
short-description="Second"
typeref="#number" size="2"
template="yes"/> <!-- Variable Data Element
-->
<DATA name="date.fractionsecond"
short-description="Fraction of Second"
typeref="#number" size="6"
template="yes"/> <!-- Variable Data Element
-->
<DATA name="date.timezone"
short-description="Time Zone"
typeref="#text" size="10"
template="yes"/> <!-- Variable Data Element
-->
<!-- "personname." Data Type
-->
<DATA name="personname.prefix"
short-description="Name Prefix"
typeref="#text"
template="yes">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="personname.given"
short-description="Given Name"
typeref="#text"
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="personname.family"
short-description="Family Name"
typeref="#text"
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="personname.middle"
short-description="Middle Name"
typeref="#text"
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="personname.suffix"
short-description="Name Suffix"
typeref="#text"
template="yes">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="personname.formatted"
short-description="Formatted Name"
typeref="#text"
template="yes">
<CATEGORIES><physical/><demographic/></CATEGORIES>
</DATA>
<DATA name="personname.nickname"
short-description="Nickname"
typeref="#text"
template="yes">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<!-- "certificate." Data Type
-->
<DATA name="certificate.key"
short-description="Certificate Key"
typeref="#binary" size="0"
template="yes">
<CATEGORIES><uniqueid/></CATEGORIES>
</DATA>
<DATA name="certificate.format"
short-description="Certificate format"
typeref="#number" size="128"
template="yes">
<CATEGORIES><uniqueid/></CATEGORIES>
</DATA>
<!-- "phonenum." Data Type -->
<DATA name="phonenum.intcode"
short-description="International Phone Code"
typeref="#number" size="11"
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="phonenum.loccode"
short-description="Local Phone Area Code"
typeref="#number" size="11"
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="phonenum.number"
short-description="Phone Number"
typeref="#number" size="30"
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="phonenum.ext"
short-description="Phone Extension"
typeref="#number" size="11"
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="phonenum.comment"
short-description="Phone Optional Comments"
typeref="#text"
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<!-- "contact." Data Type" -->
<DATA name="contact.postal."
short-description="Postal Address
Information"
typeref="#postal."
template="yes">
<CATEGORIES><physical/><demographic/></CATEGORIES>
</DATA>
<DATA name="contact.telecom."
short-description="Telecommunications
Information"
typeref="#telecom."
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="contact.online."
short-description="Online Address
Information"
typeref="#online."
template="yes">
<CATEGORIES><online/></CATEGORIES>
</DATA>
<!-- "postal." Data Type -->
<DATA name="postal.name."
short-description="Name"
typeref="#personname."
template="yes">
<CATEGORIES><physical/><demographic/></CATEGORIES>
</DATA>
<DATA name="postal.street.line1"
short-description="Street Address, Line 1"
typeref="#text"
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="postal.street.line2"
short-description="Street Address, Line 2"
typeref="#text"
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="postal.street.line3"
short-description="Street Address, Line 3"
typeref="#text"
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="postal.city"
short-description="City"
typeref="#text"
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="postal.stateprov"
short-description="State or Province"
typeref="#text"
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="postal.postalcode"
short-description="Postal Code"
typeref="#text"
template="yes">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="postal.organization"
short-description="Organization Name"
typeref="#text"
template="yes">
<CATEGORIES><physical/><demographic/></CATEGORIES>
</DATA>
<DATA name="postal.formatted"
short-description="Formatted Postal Address"
typeref="#text"
template="yes">
<CATEGORIES><physical/><demographic/></CATEGORIES>
</DATA>
<DATA name="postal.country"
short-description="Country Name"
typeref="#text"
template="yes">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="postal.countrycode"
short-description="Country Code"
typeref="#countrycode" size="2"
template="yes">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<!-- "telecom." Data Type -->
<DATA name="telecom.phone."
short-description="Phone Number"
typeref="#phonenum."
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="telecom.fax."
short-description="Fax Number"
typeref="#phonenum."
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="telecom.mobile."
short-description="Mobile Phone Number"
typeref="#phonenum."
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="telecom.pager."
short-description="Pager Number"
typeref="#phonenum."
template="yes">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<!-- "online." Data Type -->
<DATA name="online.email"
short-description="Email Address"
typeref="#text"
template="yes">
<CATEGORIES><online/></CATEGORIES>
</DATA>
<DATA name="online.uri"
short-description="Home Page Address"
typeref="#uri"
template="yes">
<CATEGORIES><online/></CATEGORIES>
</DATA>
<!-- ********** Base Data Schemas ********** -->
<!-- "dynamic." Data Schema
-->
<DATA
ref="#dynamic.clickstream.client"
short-description="Click-stream collected on the
client"
typeref="#text" source="service">
<CATEGORIES><navigation/></CATEGORIES>
</DATA>
<DATA
ref="#dynamic.clickstream.server"
short-description="Click-stream collected on the
server"
typeref="#text" source="service" >
<CATEGORIES><navigation/></CATEGORIES>
</DATA>
<DATA name="dynamic.cookies"
short-description="cookies are processed
(read/write)"
typeref="#text" source="service"/>
<!-- Variable Data Element
-->
<DATA
name="dynamic.http.useragent"
short-description="User Agent information"
typeref="#text"
source="service">
<CATEGORIES><navigation/></CATEGORIES>
</DATA>
<DATA name="dynamic.http.referrer"
short-description="Last URI requested by the
user"
typeref="#uri" source="service">
<CATEGORIES><navigation/></CATEGORIES>
</DATA>
<DATA name="dynamic.miscdata"
short-description="Miscellaneous non base data schema
information"
typeref="#text" source="service"/>
<!-- Variable Data Element
-->
<DATA name="dynamic.searchtext"
short-description="Search terms"
typeref="#text" source="service">
<CATEGORIES><interactive/></CATEGORIES>
</DATA>
<DATA
name="dynamic.interactionrecord"
short-description="server stores the transaction
history"
typeref="#text" source="service">
<CATEGORIES><interactive/></CATEGORIES>
</DATA>
<!-- "user." Data Schema -->
<DATA name="user.name."
short-description="User's Name"
typeref="#personname.">
<CATEGORIES><physical/><demographic/></CATEGORIES>
</DATA>
<DATA name="user.bdate."
short-description="User's Birth Date"
typeref="#date.">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="user.cert."
short-description="User's Identity
certificate"
typeref="#certificate.">
<CATEGORIES><uniqueid/></CATEGORIES>
</DATA>
<DATA name="user.gender"
short-description="User's gender"
typeref="#gender">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="user.jobtitle"
short-description="User's Job Title"
typeref="#text">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="user.home."
short-description="User's Home Contact
Information"
typeref="#contact.">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="user.business."
short-description="User's Business Contact
Information"
typeref="#contact.">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="user.employer"
short-description="Name of User's Employer"
typeref="#text">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="user.department"
short-description="Department or division of organization
where user is employed"
typeref="#text">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<!-- "thirdparty." Data Schema
-->
<DATA name="thirdparty.name."
short-description="Third Party's Name"
typeref="#personname.">
<CATEGORIES><physical/><demographic/></CATEGORIES>
</DATA>
<DATA name="thirdparty.bdate."
short-description="Third Party's Birth Date"
typeref="#date.">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="thirdparty.cert."
short-description="Third Party's Identity
certificate"
typeref="#certificate.">
<CATEGORIES><uniqueid/></CATEGORIES>
</DATA>
<DATA name="thirdparty.gender"
short-description="Third Party's gender"
typeref="#gender">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="thirdparty.jobtitle"
short-description="Third Party's Job Title"
typeref="#text">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="thirdparty.home."
short-description="Third Party's Home Contact
Information"
typeref="#contact.">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="thirdparty.business."
short-description="Third Party's Business Contact
Information"
typeref="#contact.">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
<DATA name="thirdparty.employer"
short-description="Name of Third Party's
Employer"
typeref="#text">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="thirdparty.department"
short-description="Department or division of organization
where third party is employed"
typeref="#text">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<!-- "business." Data Schema
-->
<DATA name="business.name"
short-description="Organization Name"
typeref="#text">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="business.department"
short-description="Department or division of
organization"
typeref="#text">
<CATEGORIES><demographic/></CATEGORIES>
</DATA>
<DATA name="business.cert."
short-description="Organization Identity
certificate"
typeref="#certificate.">
<CATEGORIES><uniqueid/></CATEGORIES>
</DATA>
<DATA name="business.contact-info."
short-description="Contact Information for the
Organization"
typeref="#contact.">
<CATEGORIES><physical/></CATEGORIES>
</DATA>
</DATA-GROUP>
</DATASCHEMA>
This appendix contains three XML Schemas, one for P3P policy reference files, one for P3P policy documents, and one for P3P dataschema documents. Note that, due to many recent changes in the syntax, these schemas may not be completely up-to-date. An XML Schema may be used to validate the structure and datatype values used in an instance of the schema given as an XML document. P3P policy and dataschema documents are XML documents that MUST conform to these schemas. Note that these schemas are based on the XML Schema working drafts [XML-Schema1][XML-Schema2], which are subject to change.
<?xml version='1.0'?>
<!-- XML Schema for policy reference files --> <schema xmlns='http://www.w3.org/1999/XMLSchema' targetNamespace='http://www.w3.org/2000/P3Pv1' xmlns:p3pr='http://www.w3.org/2000/P3Pv1' xmlns:rdf='http://www.w3.org/1999/02/22-rdf-syntax-ns#' > <element name='POLICY-REFERENCES'> <complexType content='elementOnly'> <element ref='rdf:RDF'/> <attribute name='xmlns:rdf' type='uriReference' use='fixed' value='http://www.w3.org/1999/02/22-rdf-syntax-ns'/> </complexType> </element> <element name='rdf:RDF'> <complexType content='elementOnly'> <element ref='p3pr:POLICY-REF' maxOccurs='unbounded'/> </complexType> </element> <element name='POLICY-REF'> <complexType content='elementOnly'> <choice maxOccurs='unbounded'> <element ref='p3pr:PREFIX'/> <element ref='p3pr:EXCLUDE'/> <element ref='p3pr:MEHTOD'/> </choice> <attribute name='rdf:about' type='uriReference' use='required'/> </complexType> </element> <element name='PREFIX'> <complexType content='mixed'/> </element> <element name='EXCLUDE'> <complexType content='mixed'/> </element> <element name='METHOD'> <complexType content='mixed'/> </element> </schema> <?xml version='1.0'?> <!-- XML Schema for Policy documents --> <schema xmlns='http://www.w3.org/1999/XMLSchema' targetNamespace='http://www.w3.org/2000/P3Pv1' xmlns:p3p='http://www.w3.org/2000/P3Pv1'> <element name='POLICY'> <complexType content='elementOnly'> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> <element ref='p3p:ENTITY'/> <element ref='p3p:DISPUTES-GROUP' minOccurs='0' maxOccurs='1'/> <element ref='p3p:ACCESS'/> <element ref='p3p:STATEMENT' maxOccurs='unbounded'/> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> <attribute name='discuri' type='uriReference' use='required'/> </complexType> </element> <element name='ENTITY'> <complexType content='elementOnly'> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> <element ref='p3p:DATA-GROUP'/> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> </complexType> </element> <element name='DISPUTES-GROUP'> <complexType content='elementOnly'> <element ref='p3p:DISPUTES' maxOccurs='unbounded'/> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> </complexType> </element> <element name='DISPUTES'> <complexType content='elementOnly'> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> <element ref='p3p:REMEDIES' minOccurs='0' maxOccurs='1'/> <element ref='p3p:IMAGE' minOccurs='0' maxOccurs='1'/> <element ref='p3p:SHORT-DESCRIPTION' minOccurs='0' maxOccurs='1'/> <element ref='p3p:LONG-DESCRIPTION' minOccurs='0' maxOccurs='1'/> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> <attribute name='resolution-type' use='required'> <simpleType base='string'> <enumeration value='service'/> <enumeration value='independent'/> <enumeration value='court'/> <enumeration value='law'/> </simpleType> </attribute> <attribute name='service' type='uriReference' use='required'/> <attribute name='verification' type='string' use='optional'/> <attribute name='short-description' type='string' use='optional'/> </complexType> </element> <element name='REMEDIES'> <complexType content='elementOnly'> <choice maxOccurs='unbounded'> <element ref='p3p:correct'/> <element ref='p3p:money'/> <element ref='p3p:law'/> </choice> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> </complexType> </element> <element name='correct'> <complexType content='empty'/> </element> <element name='money'> <complexType content='empty'/> </element> <element name='law'> <complexType content='empty'/> </element> <element name='IMAGE'> <complexType content='empty'> <attribute name='src' type='uriReference' use='required'/> <attribute name='alt' type='string' use='optional'/> <attribute name='width' type='nonNegativeInteger' use='optional'/> <attribute name='height' type='nonNegativeInteger' use='optional'/> </complexType> </element> <element name='LONG-DESCRIPTION'> <complexType content='mixed'> </complexType> </element> <element name='ACCESS'> <complexType content='elementOnly'> <choice> <element ref='p3p:nonident'/> <element ref='p3p:contact'/> <element ref='p3p:other_ident'/> <element ref='p3p:contact_and_other'/> <element ref='p3p:all'/> <element ref='p3p:none'/> </choice> </complexType> </element> <element name='nonident'> <complexType content='empty'/> </element> <element name='other_ident'> <complexType content='empty'/> </element> <element name='contact_and_other'> <complexType content='empty'/> </element> <element name='all'> <complexType content='empty'/> </element> <element name='none'> <complexType content='empty'/> </element> <element name='STATEMENT'> <complexType content='elementOnly'> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> <element ref='p3p:CONSEQUENCE' minOccurs='0' maxOccurs='1'/> <element ref='p3p:PURPOSE'/> <element ref='p3p:RECIPIENT'/> <element ref='p3p:RETENTION'/> <element ref='p3p:DATA-GROUP' maxOccurs='unbounded'/> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> </complexType> </element> <element name='CONSEQUENCE'> <complexType content='mixed'> </complexType> </element> <element name='PURPOSE'> <complexType content='elementOnly'> <choice maxOccurs='unbounded'> <element ref='p3p:current'/> <element ref='p3p:admin'/> <element ref='p3p:develop'/> <element ref='p3p:contact'/> <element ref='p3p:customization'/> <element ref='p3p:targeting'/> <element ref='p3p:profiling'/> <element ref='p3p:other-purpose'/> </choice> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> </complexType> </element> <element name='current'> <complexType content='empty'> <attribute name='change_preferences' use='default' value='no'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> </complexType> </element> <element name='admin'> <complexType content='empty'> <attribute name='change_preferences' use='default' value='no'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> </complexType> </element> <element name='develop'> <complexType content='empty'> <attribute name='change_preferences' use='default' value='no'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> </complexType> </element> <element name='contact'> <complexType content='empty'> <attribute name='change_preferences' use='default' value='no'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> </complexType> </element> <element name='customization'> <complexType content='empty'> <attribute name='change_preferences' use='default' value='no'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> </complexType> </element> <element name='targeting'> <complexType content='empty'> <attribute name='change_preferences' use='default' value='no'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> </complexType> </element> <element name='profiling'> <complexType content='empty'> <attribute name='change_preferences' use='default' value='no'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> </complexType> </element> <element name='other-purpose'> <complexType content='mixed'> <attribute name='change_preferences' use='default' value='no'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> </complexType> </element> <element name='RECIPIENT'> <complexType content='elementOnly'> <choice maxOccurs='unbounded'> <element ref='p3p:ours'/> <element ref='p3p:same'/> <element ref='p3p:other-recipient'/> <element ref='p3p:delivery'/> <element ref='p3p:public'/> <element ref='p3p:unrelated'/> </choice> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> </complexType> </element> <element name='ours'> <complexType content='empty'/> </element> <element name='same'> <complexType content='empty'/> </element> <element name='other-recipient'> <complexType content='empty'/> </element> <element name='delivery'> <complexType content='empty'/> </element> <element name='public'> <complexType content='empty'/> </element> <element name='unrelated'> <complexType content='empty'/> </element> <element name='RETENTION'> <complexType content='elementOnly'> <choice> <element ref='p3p:no-retention'/> <element ref='p3p:stated-purpose'/> <element ref='p3p:legal-requirement'/> <element ref='p3p:indefinitely'/> <element ref='p3p:business-practices'/> </choice> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> </complexType> </element> <element name='no-retention'> <complexType content='empty'/> </element> <element name='stated-purpose'> <complexType content='empty'/> </element> <element name='legal-requirement'> <complexType content='empty'/> </element> <element name='indefinitely'> <complexType content='empty'/> </element> <element name='business-practices'> <complexType content='empty'/> </element> <element name='DATA-GROUP'> <complexType content='elementOnly'> <element ref='p3p:DATA' maxOccurs='unbounded'/> <element ref='p3p:EXTENSION' minOccurs='0' maxOccurs='unbounded'/> <attribute name='base' type='uriReference' use='default' value='http://www.w3.org/TR/P3P/base'/> </complexType> </element> <element name='DATA'> <complexType content='mixed'> <choice minOccurs='0' maxOccurs='unbounded'> <element ref='p3p:CATEGORIES'/> <element ref='p3p:EXTENSION'/> </choice> <attribute name='name' type='uriReference' use='required'/> <attribute name='optional' use='default' value='no'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> <attribute name='typeref' type='uriReference' use='required'/> <attribute name='template' use='default' value='no'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> <attribute name='short' type='string' use='optional'/> <attribute name='long' type='string' use='optional'/> <attribute name='size' type='nonNegativeInteger' use='optional'/> </complexType> </element> <element name='CATEGORIES'> <complexType content='elementOnly'> <choice maxOccurs='unbounded'> <element ref='p3p:physical'/> <element ref='p3p:online'/> <element ref='p3p:uniqueid'/> <element ref='p3p:purchase'/> <element ref='p3p:financial'/> <element ref='p3p:computer'/> <element ref='p3p:navigation'/> <element ref='p3p:interactive'/> <element ref='p3p:demographic'/> <element ref='p3p:content'/> <element ref='p3p:state'/> <element ref='p3p:political'/> <element ref='p3p:health'/> <element ref='p3p:preference'/> <element ref='p3p:other'/> </choice> </complexType> </element> <element name='physical'> <complexType content='empty'/> </element> <element name='online'> <complexType content='empty'/> </element> <element name='uniqueid'> <complexType content='empty'/> </element> <element name='purchase'> <complexType content='empty'/> </element> <element name='financial'> <complexType content='empty'/> </element> <element name='computer'> <complexType content='empty'/> </element> <element name='navigation'> <complexType content='empty'/> </element> <element name='interactive'> <complexType content='empty'/> </element> <element name='demographic'> <complexType content='empty'/> </element> <element name='content'> <complexType content='empty'/> </element> <element name='state'> <complexType content='empty'/> </element> <element name='political'> <complexType content='empty'/> </element> <element name='health'> <complexType content='empty'/> </element> <element name='preference'> <complexType content='empty'/> </element> <element name='other'> <complexType content='empty'/> </element> <element name='EXTENSION'> <complexType content='mixed'> <attribute name='optional' use='default' value='yes'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> </complexType> </element> </schema> <?xml version='1.0'?> <!-- XML Schema for Dataschemas --> <schema xmlns='http://www.w3.org/1999/XMLSchema' targetNamespace='http://www.w3.org/2000/P3Pv1' xmlns:p3ps='http://www.w3.org/2000/P3Pv1'> <element name='DATASCHEMA'> <complexType content='elementOnly'> <choice minOccurs='0' maxOccurs='unbounded'> <element ref='p3ps:DATA'/> <element ref='p3ps:EXTENSION'/> </choice> </complexType> </element> <element name='DATA'> <complexType content='mixed'> <choice minOccurs='0' maxOccurs='unbounded'> <element ref='p3ps:CATEGORIES'/> <element ref='p3ps:EXTENSION'/> </choice> <attribute name='name' type='uriReference' use='required'/> <attribute name='optional' use='default' value='no'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> <attribute name='type' type='uriReference' use='optional'/> <attribute name='template' use='default' value='no'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> <attribute name='short' type='string' use='optional'/> <attribute name='long' type='string' use='optional'/> <attribute name='size' type='nonNegativeInteger' use='default' value='0'/> </complexType> </element> <element name='CATEGORIES'> <complexType content='elementOnly'> <choice maxOccurs='unbounded'> <element ref='p3ps:physical'/> <element ref='p3ps:online'/> <element ref='p3ps:uniqueid'/> <element ref='p3ps:purchase'/> <element ref='p3ps:financial'/> <element ref='p3ps:computer'/> <element ref='p3ps:navigation'/> <element ref='p3ps:interactive'/> <element ref='p3ps:demographic'/> <element ref='p3ps:content'/> <element ref='p3ps:state'/> <element ref='p3ps:political'/> <element ref='p3ps:health'/> <element ref='p3ps:preference'/> <element ref='p3ps:other'/> </choice> </complexType> </element> <element name='physical'> <complexType content='empty'/> </element> <element name='online'> <complexType content='empty'/> </element> <element name='uniqueid'> <complexType content='empty'/> </element> <element name='purchase'> <complexType content='empty'/> </element> <element name='financial'> <complexType content='empty'/> </element> <element name='computer'> <complexType content='empty'/> </element> <element name='navigation'> <complexType content='empty'/> </element> <element name='interactive'> <complexType content='empty'/> </element> <element name='demographic'> <complexType content='empty'/> </element> <element name='content'> <complexType content='empty'/> </element> <element name='state'> <complexType content='empty'/> </element> <element name='political'> <complexType content='empty'/> </element> <element name='health'> <complexType content='empty'/> </element> <element name='preference'> <complexType content='empty'/> </element> <element name='other'> <complexType content='empty'/> </element> <element name='EXTENSION'> <complexType content='mixed'> <attribute name='optional' use='default' value='yes'> <simpleType base='string'> <enumeration value='yes'/> <enumeration value='no'/> </simpleType> </attribute> </complexType> </element> </schema>
This appendix contains the DTDs for policy documents and for dataschemas. Note that, due to many recent changes in the syntax, these DTDs may not be completely up-to-date. The following is the XML DTD for P3P policy documents.
<!-- ************** Entities ************** --> <!ENTITY % URI "CDATA"> <!ENTITY % NUMBER "CDATA"> <!-- *********** POLICY *********** --> <!ELEMENT POLICY (EXTENSION*, ENTITY, DISPUTES-GROUP?, ACCESS, STATEMENT+, EXTENSION*)> <!ATTLIST POLICY discuri %URI; #REQUIRED > <!-- *********** ENTITY *********** --> <!ELEMENT ENTITY (EXTENSION*, DATA-GROUP, EXTENSION*)> <!-- *********** DISPUTES *********** --> <!ELEMENT DISPUTES-GROUP (DISPUTES+, EXTENSION*)> <!ELEMENT DISPUTES (EXTENSION*, REMEDIES?, IMAGE?, LONG-DESCRIPTION?, EXTENSION*)> <!ATTLIST DISPUTES resolution-type (service | independent | court | law) #REQUIRED service %URI; #REQUIRED verification CDATA #IMPLIED short-description CDATA #IMPLIED > <!-- *********** REMEDIES *********** --> <!ELEMENT REMEDIES ((correct | money | law)+, EXTENSION*)> <!ELEMENT correct EMPTY> <!ELEMENT money EMPTY> <!ELEMENT law EMPTY> <!-- *********** IMAGE *********** --> <!ELEMENT IMAGE EMPTY> <!ATTLIST IMAGE src %URI; #REQUIRED alt CDATA #IMPLIED width %NUMBER; #IMPLIED height %NUMBER; #IMPLIED > <!-- *********** DESCRIPTION *********** --> <!ELEMENT LONG-DESCRIPTION (#PCDATA)> <!-- *********** ACCESS *********** --> <!ELEMENT ACCESS (nonident | contact | other_ident | contact_and_other | all | none)> <!ELEMENT nonident EMPTY> <!ELEMENT other_ident EMPTY> <!ELEMENT contact_and_other EMPTY> <!ELEMENT all EMPTY> <!ELEMENT none EMPTY> <!-- *********** STATEMENT *********** --> <!ELEMENT STATEMENT (EXTENSION*, CONSEQUENCE?, PURPOSE, RECIPIENT, RETENTION, DATA-GROUP+, EXTENSION*)> <!-- *********** CONSEQUENCE *********** --> <!ELEMENT CONSEQUENCE (#PCDATA)> <!-- *********** PURPOSE *********** --> <!ELEMENT PURPOSE ((current | admin | develop | contact | customization | targeting | profiling | other-purpose)+, EXTENSION*)> <!ELEMENT current EMPTY> <!ATTLIST current change_preferences (yes | no) "no" > <!ELEMENT admin EMPTY> <!ATTLIST admin change_preferences (yes | no) "no" > <!ELEMENT develop EMPTY> <!ATTLIST develop change_preferences (yes | no) "no" > <!ELEMENT contact EMPTY> <!ATTLIST contact change_preferences (yes | no) "no" > <!ELEMENT customization EMPTY> <!ATTLIST customization change_preferences (yes | no) "no" > <!ELEMENT targeting EMPTY> <!ATTLIST targeting change_preferences (yes | no) "no" > <!ELEMENT profiling EMPTY> <!ATTLIST profiling change_preferences (yes | no) "no" > <!ELEMENT other-purpose (#PCDATA)> <!ATTLIST other-purpose change_preferences (yes | no) "no" > <!-- *********** RECIPIENT *********** --> <!ELEMENT RECIPIENT ((ours | same | other-recipient | delivery | public | unrelated)+, EXTENSION*)> <!ELEMENT ours EMPTY> <!ELEMENT same EMPTY> <!ELEMENT other-recipient EMPTY> <!ELEMENT delivery EMPTY> <!ELEMENT public EMPTY> <!ELEMENT unrelated EMPTY> <!-- *********** RETENTION *********** --> <!ELEMENT RETENTION ((no-retention | stated-purpose | legal-requirement | indefinitely | business-practices), EXTENSION*)> <!ELEMENT no-retention EMPTY> <!ELEMENT stated-purpose EMPTY> <!ELEMENT legal-requirement EMPTY> <!ELEMENT indefinitely EMPTY> <!ELEMENT business-practices EMPTY> <!-- *********** DATA *********** --> <!ELEMENT DATA-GROUP (DATA+, EXTENSION*)> <!ATTLIST DATA-GROUP base %URI; "http://www.w3.org/TR/P3P/base" > <!ELEMENT DATA (#PCDATA | CATEGORIES | EXTENSION)*> <!ATTLIST DATA ref %URI; #REQUIRED optional (yes | no) "no" typeref %URI; #IMPLIED template (yes | no) "no" short CDATA #IMPLIED long CDATA #IMPLIED size %NUMBER; #IMPLIED > <!-- *********** CATEGORIES *********** --> <!ELEMENT CATEGORIES (physical | online | uniqueid | purchase | financial | computer | navigation | interactive | demographic | content | state | political | health | preference | other)+> <!ELEMENT physical EMPTY> <!ELEMENT online EMPTY> <!ELEMENT uniqueid EMPTY> <!ELEMENT purchase EMPTY> <!ELEMENT financial EMPTY> <!ELEMENT computer EMPTY> <!ELEMENT navigation EMPTY> <!ELEMENT interactive EMPTY> <!ELEMENT demographic EMPTY> <!ELEMENT content EMPTY> <!ELEMENT state EMPTY> <!ELEMENT political EMPTY> <!ELEMENT health EMPTY> <!ELEMENT preference EMPTY> <!ELEMENT other EMPTY> <!-- *********** EXTENSION *********** --> <!ELEMENT EXTENSION (#PCDATA)> <!ATTLIST EXTENSION optional (yes | no) "yes" >
The following is the DTD for policy reference files:
<!-- ************** Entities ************** --> <!ENTITY % URI "CDATA"> <!ENTITY % RDFNS "web"> <!ENTITY % RDF "%RDFNS;:RDF"> <!-- ****** POLICY-REFERENCES ****** --> <!ELEMENT POLICY-REFERENCES (%RDFNS;:RDF)> <!ATTLIST POLICY-REFERENCES xmlns:%RDFNS; %URI; #FIXED "http://www.w3.org/1999/02/22-rdf-syntax-ns" > <!-- ****** RDF ****** --> <!ELEMENT %RDF; (POLICY-REF+)> <!-- ****** POLICY-REF ****** --> <!ELEMENT POLICY-REF (PREFIX | EXCLUDE | MEHTOD)+> <!ATTLIST POLICY-REF %RDFNS;:about %URI; #REQUIRED > <!-- ****** PREFIX ****** --> <!ELEMENT PREFIX (#PCDATA)> <!-- ****** EXCLUDE ****** --> <!ELEMENT EXCLUDE (#PCDATA)> <!-- ****** METHOD ****** --> <!ELEMENT METHOD (#PCDATA)>
The following is the DTD for dataschemas:
<!-- ************** Entities ************** --> <!ENTITY % URI "CDATA"> <!ENTITY % NUMBER "CDATA"> <!-- *********** DATASCHEMA *********** --> <!ELEMENT DATASCHEMA (DATA | EXTENSION)*> <!-- *********** DATA *********** --> <!ELEMENT DATA (#PCDATA | CATEGORIES | EXTENSION)*> <!ATTLIST DATA name %URI; #REQUIRED optional (yes | no) "no" typeref %URI; #REQUIRED template (yes | no) "no" short CDATA #IMPLIED long CDATA #IMPLIED size %NUMBER; "0" > <!-- *********** CATEGORIES *********** --> <!ELEMENT CATEGORIES (physical | online | uniqueid | purchase | financial | computer | navigation | interactive | demographic | content | state | political | health | preference | other)+> <!ELEMENT physical EMPTY> <!ELEMENT online EMPTY> <!ELEMENT uniqueid EMPTY> <!ELEMENT purchase EMPTY> <!ELEMENT financial EMPTY> <!ELEMENT computer EMPTY> <!ELEMENT navigation EMPTY> <!ELEMENT interactive EMPTY> <!ELEMENT demographic EMPTY> <!ELEMENT content EMPTY> <!ELEMENT state EMPTY> <!ELEMENT political EMPTY> <!ELEMENT health EMPTY> <!ELEMENT preference EMPTY> <!ELEMENT other EMPTY> <!-- *********** EXTENSION *********** --> <!ELEMENT EXTENSION (#PCDATA)> <!ATTLIST EXTENSION optional (yes | no) "yes" >
This diagram illustrates the RDF [RDF] data model for the policy shown in Example 3.1. Note this representation can be not completely up-to-date with respect to the current syntax, the working group is currently working on synchronizing this RDF representation with all the most recent changes.
The formal grammar of P3P is given in this specification using a slight modification of [ABNF]. The following is a simple description of the ABNF.
name = (elements)
(
element1 element2)
<a>*<b>element
<a>element
<a>*element
*<b>element
*element
[element]
"string"
or 'string'
Other notations used in the productions are:
This appendix describes the intent of P3P development and recommends guidelines regarding the responsible use of P3P technology. An earlier version was published in the W3C Note "P3P Guiding Principles".
The Platform for Privacy Preferences Project (P3P) has been designed to be flexible and support a diverse set of user preferences, public policies, service provider polices, and applications. This flexibility will provide opportunities for using P3P in a wide variety of innovative ways that its designers had not imagined. The P3P Guiding Principles were created in order to: express the intentions of the undersigned members of the P3P working groups when designing this technology and suggest how P3P can be used most effectively in order to maximize privacy and user confidence and trust on the Web. In keeping with our goal of flexibility, this document does not place requirements upon any party. Rather, it makes recommendations about 1) what should be done to be consistent with the intentions of the P3P designers and 2) how to maximize user confidence in P3P implementations and Web services. Organizations, individuals, policy-makers, and companies who use P3P are invited to join us in supporting these principles.
P3P has been designed to promote privacy and trust on the Web by enabling service providers to disclose their information practices, and enabling individuals to make informed decisions about the collection and use of their personal information. P3P user agents work on behalf of individuals to reach agreements with service providers about the collection and use of personal information. Trust is built upon the mutual understanding that each party will respect the agreement reached.
Service providers should preserve trust and protect privacy by applying relevant laws and principles of data protection and privacy to their information practices. The following is a list of privacy principles and guidelines that helped inform the development of P3P and may be useful to those who use P3P:
In addition, service providers and P3P implementers should recognize and address the special concerns surrounding children's privacy.
Service providers should provide timely and effective notices of their information practices, and user agents should provide effective tools for users to access these notices and make decisions based on them.
Service providers should:
User agents should:
Users should be given the ability to make meaningful choices about the collection, use, and disclosure of personal information. Users should retain control over their personal information and decide the conditions under which they will share it.
Service providers should:
User agents should:
Service providers should treat users and their personal information with fairness and integrity. This is essential for protecting privacy and promoting trust.
Service providers should:
User agents should:
While P3P itself does not include security mechanisms, it is intended to be used in conjunction with security tools. Users' personal information should always be protected with reasonable security safeguards in keeping with the sensitivity of the information.
Service providers should:
User agents should:
This specification was produced by the P3P Specification Working Group. The following individuals participated in the P3P Specification Working Group, chaired by Lorrie Cranor (AT&T): Mark Ackerman (University of California, Irvine), Margareta Björksten (Nokia), Joe Coco (Microsoft), Patrick Feng (RPI), Yuichi Koike (NEC/W3C), Daniel LaLiberte (Crystaliz), Marc Langheinrich (NEC/ETH Zurich), Daniel Lim (PrivacyBank), Massimo Marchiori (W3C/MIT), Christine McKenna (Phone.com, Inc.), Paul Perry (Microsoft), Martin Presler-Marshall (IBM), Joel Reidenberg (Fordham Law School), Dave Remy (Geotrust), Ari Schwartz (CDT), Rigo Wenning (W3C), Betty Whitaker (NCR), Sam Yen (Citigroup), Alan Zausner (American Express).
The P3P Specification Working Group inherited a large part of the specification from previous P3P Working Groups. The Working Group would like to acknowledge the contributions of the members of these previous groups (affiliations shown are the members' affiliations at the time of their participation in each Working Group).
The P3P Implementation and Deployment Working Group, chaired by Rolf Nelson (W3C) and Marc Langheinrich (NEC/ETH Zurich): Mark Ackerman (University of California, Irvine), Rob Barrett (IBM), Joe Coco (Microsoft), Lorrie Cranor (AT&T), Massimo Marchiori (W3C/MIT), Gabe Montero (IBM), Stephen Morse (Netscape), Paul Perry (Microsoft), Ari Schwartz (CDT), Gabriel Speyer (Citibank), Betty Whitaker (NCR).
The P3P Syntax Working Group, chaired by Steve Lucas (Matchlogic): Lorrie Cranor (AT&T), Melissa Dunn (Microsoft), Daniel Jaye (Engage Technologies), Massimo Marchiori (W3C/MIT), Maclen Marvit (Narrowline), Max Metral (Firefly), Paul Perry (Firefly), Martin Presler-Marshall (IBM), Drummond Reed (Intermind), Joseph Reagle (W3C).
The P3P Vocabulary Harmonization Working Group, chaired by Joseph Reagle (W3C): Liz Blumenfeld (America Online), Ann Cavoukian (Information and Privacy Commission/Ontario), Scott Chalfant (Matchlogic), Lorrie Cranor (AT&T), Jim Crowe (Direct Marketing Association), Josef Dietl (W3C), David Duncan (Information and Privacy Commission/Ontario), Melissa Dunn (Microsoft), Patricica Faley (Direct Marketing Association), Marit Köhntopp (Privacy Commissioner of Schleswig-Holstein, Germany), Tony Lam (Hong Kong Privacy Commissioner's Office), Tara Lemmey (Narrowline), Jill Lesser (America Online), Steve Lucas (Matchlogic), Deirdre Mulligan (Center for Democracy and Technology), Nick Platten (Data Protection Consultant, formerly of DG XV, European Commission), Ari Schwartz (Center for Democracy and Technology), Jonathan Stark (TRUSTe).
The P3P Protocols and Data Transport Working Group, chaired by Yves Leroux (Digital): Lorrie Cranor (AT&T), Philip DesAutels (Matchlogic), Melissa Dunn (Microsoft), Peter Heymann (Intermind), Tatsuo Itabashi (Sony), Dan Jaye (Engage), Steve Lucas (Matchlogic), Jim Miller (W3C), Michael Myers (VeriSign), Paul Perry (FireFly), Martin Presler-Marshall (IBM), Joseph Reagle (W3C), Drummond Reed (Intermind), Craig Vodnik (Pencom Web Worlds).
The P3P Vocabulary Working Group, chaired by Lorrie Cranor (AT&T): Mark Ackerman (W3C), Philip DesAutels (W3C), Melissa Dunn (Microsoft), Joseph Reagle (W3C), Upendra Shardanand (Firefly).
The P3P Architecture Working Group, chaired by Martin Presler-Marshall (IBM): Mark Ackerman (W3C), Lorrie Cranor (AT&T), Philip DesAutels (W3C), Melissa Dunn (Microsoft), Joseph Reagle (W3C).
Finally, Appendix 8 is drawn by the W3C Note "P3P Guiding Principles", whose signatories are: Azer Bestavros (Bowne Internet Solutions), Ann Cavoukian (Information and Privacy Commission Ontario Canada), Lorrie Faith Cranor (AT&T Labs-Research), Josef Dietl (W3C), Daniel Jaye (Engage Technologies), Marit Köhntopp (Land Schleswig-Holstein), Tara Lemmey (Narrowline; TrustE), Steven Lucas (MatchLogic), Massimo Marchiori (W3C/MIT), Dave Marvit (Fujitsu Labs), Maclen Marvit (Narrowline Inc.), Yossi Matias (Tel Aviv University), James S. Miller (MIT), Deirdre Mulligan (Center for Democracy and Technology), Joseph Reagle (W3C), Drummond Reed (Intermind), Lawrence C. Stewart (Open Market, Inc.).
Change log from the 2 November 1999 Specification (last call):