Piwik: The Best Alternative to Urchin! Web Analytics via Log File Import

Contents

Urchin development and support will be discontinued by Google as of March 2012. Urchin was Log Analysis software bought by Google in 2005. They used this software as a base for Google Analytics but have now announced they will focus exclusively on Google Analytics. We have since received a number of emails from Urchin users, asking if Matomo (Piwik) could be set up to carry out log analysis in the same way as Urchin, and import all past logs in a Matomo server.

We are happy to say that we have been developing a powerful, simple to use script that will analyse your webserver log files (Apache, Nginx, IIS, Akamai, etc.) and will import visits and page views into Matomo.

We hope that over the next few months, Matomo will become the best alternative to Urchin and AWStats (and others).

! UPDATE 2012, March 20th !

We have now released the beta version of the script to import and analyze server logs using Matomo.

Find all of the documentation and details on the Server Log Analytics page.

Matomo Features When Used to Import Log Files

Matomo normally uses JavaScript code to track visits and pages. This new script also makes it easy to track visits by importing one or many web server log files into Matomo. Here are some examples of when you might want to use the script:

  • if you are unable to add the JS code to the websites
  • if you wish to import large amount of historical data at once
  • if you are looking for a software that does the same thing as Urchin, AWStats, Webanalyzer or Webtrends.

Some features of the Matomo log import script include:

  • Great performance: we have tested it to track several millions of log lines per day with success. See the Matomo for high traffic websites check list.
  • Bot traffic is automatically excluded: to keep your web analytics report clean and useful, with increased performance.
  • Track using more than one method: Matomo can track some websites with the standard JavaScript code, and other websites could be tracked by importing the access logs. You could, for example, use JavaScript tagging for websites 1 and 3, and log import for sites 2 and 4. We expect these hybrid Matomo servers to become a common configuration among the community.
  • File downloads appearing in the logs will be automatically tracked as “Downloads” in Matomo
  • Access to all Matomo features: because logs are imported via the Tracking API, all Matomo features will be supported (Goal tracking based on URL, IP Anonymization, Visitor log, etc.)
  • This script will effectively replace Apache2Piwik, the new tool providing more features and better performance.
  • In later versions we are planning to support log reprocessing, error code tracking, search engine & spam Bot tracking, features to use the logs to enhance existing JS tracked pages, and more (based on user popularity and feedback).

Note: Some reports will have no data because the log data is more limited that data obtained via Javascript. For example: screen resolutions, Supported Browser plugins, Custom variables, Ecommerce Analytics will not work.

This script was written in Python and is released under the GPL license, for free (just like Matomo!)

Perfect for Web Hosting Companies and Web Agencies, and for One-Off Log Imports

The script will have 2 modes:

  1. Web Host – web analytics provider user
    This mode is ideal for web hosts, where new websites are often added in the access logs, but the Matomo admin does not wish to manually create each website. The script will automatically detect the Matomo website ID to track based on the URL being parsed: it will look for any Matomo website registered with a URL or “Alias URL” set to this page view host. If a website with the hostname doesn’t exist, a new website is automatically created for this URL.
    A summary is then emailed to the Matomo Super User so he/she knows which websites are automatically created by the log import script and can create users or assign permissions to view these new websites.
  2. Simple log import for one or a few websites only
    This mode is ideal if you import only a small number of websites or if you wish to control exactly in which websites requests are to be tracked.
    When a line contains a URL to an unknown Matomo website, Matomo will ignore all these page views and will report, at the end of the script execution, the list of hostnames that were not matched to any website in Matomo.
    If these unknown URLs turn out to be legitimate page views, you can either create a new website manually, or add an Alias URL to an existing website, so the page URLs are directly tracked in this website the next time you import similar logs.

Join the Beta Testing Group

To be part of our beta testing group, please email us at hello@piwik.org and mention the testing of the Urchin/Awstats log import script. Please also mention the number of websites you wish to track, how many pages per day, and if you are willing to test the script and report bugs or feedback.

Featured Sponsor: a Web Host that Tracks Millions of Log Lines with Matomo

This work is sponsored by Alwaysdata, a French web hosting company. They provide Matomo as their web analytics package of choice, deprecating AWStats, for thousands of their users. They have been using Matomo for a few years and we are finally integrating this log import analytics key feature in Matomo, as well as ensuring good performance for the script. We want to make it easy for web hosts and large web agencies to use it as their Web analytics platform.

Goodbye Urchin + Scale of Google Analytics in 2012

The Google Analytics team have decided to focus on the privately hosted Google Analytics (GA) service and discontinue the log analysis version (Urchin). At Matomo we are quite simply amazed at the scale and reach of Google Analytics in 2012: GA is used by over 55% of all internet websites (source). At least 15 million websites use Google Analytics! (source). In comparison Matomo is used by 1% of the Internet (cheers!) and 250k+ websites.

Millions of pings (page views) are tracked by GA per SECOND. This is enough to make any software developer speechless. We can only congratulate Google engineers and product designers for the work they are doing to track and aggregate so much data, while allowing users to slice this data in real time across dozens of dimensions. This is an amazing technical milestone. We also hope that Google users’ privacy will be respected and privacy standards will improve in the future.

Regarding the end of Urchin, we at Matomo will do our best to provide to existing Urchin users a good user experience when they upgrade to Matomo to try the leading free software platform. If you are a Urchin user and would like to try Matomo, send us an email us at hello@piwik.org with your current setup. We will help and check if similar functionality is do-able with Matomo and the log import script.

Privacy & Security implications of self hosting your web analytics data

Ensuring the full control over your customers’ log files and Matomo database are important requirements if you are a web agency or a web host providing web analytics to hundreds or thousands of users.

The tips on the Privacy page will help ensure that you make changes to data collection and data retention required by your Privacy Policy. We also focus on Code security and recommend that all Matomo users to spend some time securing their Matomo server.

Matomo Also an Alternative to AWStats & Webalizer: Modern UI, Better Performance, and More!

We hope that Matomo will become the leading alternative to Urchin and to AWStats. AWStats was a great tool but we hope to modernize the log analysis open source software world and make use of all the great Matomo features and capabilities in terms of data analysis and graphing. Users in 2012 and beyond will need a modern interface to access the data gathered from their web server access logs.

We expect a release in 1-2 months. Stay tuned…

Happy Log files import with Matomo!

Enjoyed this post?
Join the 160,000+ subscribers who receive the Matomo Newsletter straight to their inbox every month
Get started with Matomo

A powerful web analytics platform that gives you and your business 100% data ownership and user privacy protection.

No credit card required.

Free forever.

Get started with Matomo

A powerful web analytics platform that gives you and your business 100% data ownership and user privacy protection.

No credit card required.

Free forever.