Back to Dot-Com Builder How-Tos Archive
Things to Consider When Selecting a Web Traffic Analysis Tool
If you have a Web site to build or manage, you have an audience to cater to. How are you going to measure their activity, and how will you know if you've succeeded? How much traffic analysis is required on your site? Thomas Jensen, Audience Analyst for Sun, offers advice on selecting and implementing a traffic analysis tool. Traffic analysis solutions range from expensive, complex, enterprise-level software applications with database back ends to freeware tools that generate HTML reports from raw Web server logs. Another fundamental distinction concerns data collection methods, i.e., whether data is captured from Web server log files or by sniffing packets on the network. Before You Buy: Things to Know
1) Find out your hardware requirements. And with free Web traffic analysis tools, remember that only the software is free. Enterprise-scale sites have to devote enterprise-scale hardware and disk space to traffic analysis.
2) Review the purpose and business goals of your site. Involve the decision makers in your organization, and find out what kind of decisions they make on a daily/weekly basis. Instead of asking them about "data," ask questions like, "What do you base your decisions on?" or "What are we trying to accomplish with this site?" Then decide what traffic analysis reports are needed.
3) Appoint an administrator for the project.
4) Be aware of the limitations of traffic analysis packages. Cookies might be the answer, but check your privacy policy. And don't even think of pushing cookies to people unless you give them something in return, Jensen advises. It's simply a matter of courtesy. Supermarkets don't stick a bar code on shoppers' clothes when they enter, but they might give discounts to registered, card-carrying customers.
5) Don't expect true numbers from traffic analysis tools. More significantly, a good amount of traffic to your site (easily one-third of the total) does not come from the real flesh-and-blood users you are interested in analyzing. The most notable sources of such "noise" traffic are search-engine spiders, but many other automated clients are also out there. If you have anything besides unrestricted, plain HTML pages on your site, these clients are prone to breaking and getting stuck in request loops. This traffic not only inflates your tracking total, but can also skew it severely. Jensen has yet to discover a tool that identifies such unwanted (for stats purposes) clients by their behavior, rather than their IP address or user agent. "Common sense in interpreting the numbers still goes a longer way than relentless pursuit of the perfect configuration," he says.
6) Decide what to count and what to filter out. To filter with log-parsing tools, you can presort or preprocess logs using a Perl script or simply the 'grep' command. If you use a packet sniffer, you rely on the software for all filtering. Criteria for Selection
1) The most obvious -- reliability: With a log-based tool, the Web server logs can be reprocessed/reloaded -- but how lengthy and arduous is the recovery process? Packet-sniffing traffic analysis tools are responsible for the data capture, so at least that function must have 100% availability. Also, tools must be able to store data, if it cannot be written to the database at a given time.
2) User interface and usability:
3) Ease of configuring reports: Also, is it necessary to specify all the variables every time you select a report? This may be too challenging for certain users. How does the report track visitors, and can you select different options? For example, will the package count all visitors, or only external visitors?
4) Ad hoc reporting: Ad hoc reporting may not be available for smaller, cheaper tools, but you can write log preprocessing scripts in order to meet some of your reporting needs.
5) Flexibility: Useful Optional Features Some features may not be standard in all packages but can be useful, depending on your needs. Examples include: Ability to aggregate content in reports, instead of just reporting by individual URLs. Another method is to truncate URLs in order to get directory-level reports, but this solution sacrifices the drill-down capability. Good click-stream analysis, such as a report on the top 10 paths through the site or incoming and outgoing clicks to and from a single page. Note that calculating click-stream reports can be quite resource intensive, so ask for only what you really need. Report user management by a designated superuser or administrator. For example, users can get accounts with an inbox to regularly receive relevant reports. Advice on Implementation Before you implement a new tool, think carefully about picking "ground zero." Choose a meaningful release date for the new reports, such as the beginning of a fiscal year or a fiscal quarter, and make sure it's attainable. Find out what information you already have. Do you have old Web server logs or reports? Think about how you will compare data over time. You want to ensure comparability over time -- CEOs probably want to see growth rates, not page-view counts. Any time you change to a new tool, it invariably means a shift in your measurement methodology, so you need to know how the new numbers relate to the old. Don't just give people the stats they're already getting. For example, people may be accustomed to reports counting hits or total users, and they may be making inferences about usage and popularity that would be more appropriately based on page views or daily unique visitors. Start a new methodology to discover your staff's needs, and then configure the reports to meet those needs. The real value of these tools is not in measuring traffic but in providing people with information. As Jensen says, "It's easy enough to implement your Web stats software and walk away, but that's not doing your company any favors."
|
| ||