Usage Data Collection- Policies, Technologies, etc.

Some notes on groups that do data collection in order to have a better idea of the size/character of their users.

Mozilla

https://wiki.mozilla.org/Browser_Metrics

http://www.mozilla.com/en-US/legal/privacy/firefox-en.html (in particular, ‘automated update services’) opt-out; all data is gathered purely from information in http headers in updates, so no extra tech built in.

Wordpress

http://ma.tt/2008/07/26-by-the-numbers/ (look for ‘update system’; can’t find other details)

CiviCRM

http://civicrm.org/node/274 (current system)

http://civicrm.org/node/430 (proposed collection for next release)

no privacy policy/data retention policy yet, but do md5 the URL of data submitters. Do not discard IP numbers from that log file; feel they probably should.

found it useful; twice as many Joomla-based users as expected

opt-out.

Eclipse

data heavy- more to get a sense of what features people are using, rather than how many users there are. opt-in as a result of privacy concerns; over 30K opt-in users.

Web site for UDC: http://www.eclipse.org/epp/usagedata/index.php

Terms of use: http://www.eclipse.org/org/usagedata/terms.php

Two communications regarding UDC:

http://dev.eclipse.org/blogs/mike/2008/06/05/collecting-usage-data/

http://dev.eclipse.org/blogs/mike/2008/06/06/using-usage-data/