| |
Path: Home » Publications » Handbook on cultural web user interaction »
Table of contents » Chapter 2 » Chapter 2.6
| |
Handbook on cultural web user interaction
|
2.6 Audience measurement on the InternetThe aim of this section is to review the techniques and metrics used for audience measurement in the Web. By audience measurement we mean the methods used for calculating how many people form part of an audience, that is a group of people reached by a message (television programme, advertising, multimedia content, written texts, etc.). The main aim of audience measurement is therefore to quantify the public, not just in terms of numbers, but also in terms of socio-demographic characteristics (sex, age, study title, geographical area, place of use, taste, behaviour, etc.). On the Internet, audience measurementis carried out for a variety of reasons, among which are: Certain special aspects of the network must however be considered. Within mass communication, the Internet represents a real revolution: in traditional media, communication is from one to many whereas there is a two-way relationship between the user and the web, a constant dialogue between content and navigator. The interface of all the new media (web messenger, chat, blog, etc.) must also be taken into account. These are designed for viewing content and interacting with them in different ways. On the one hand, the user expresses his intentions; on the other, the system, through the aid of technologies in constant evolution, responds to the actions of the user. Many professionals find themselves using methods of measurement. Among these we find producers of online content, computer industry professionals, public administration and e-government operators, webmasters, those in charge of commercial strategies in the network, etc. Techniques for audience measurement change over time, both as a result of technological development and in response to the evolution and use of the network. In order to become familiar with these subjects, below is a list of the terminology currently in use on these themes, which are quite extensive and diversified. Literature does in fact distinguish between: Up to now, the quantitative analysis of the audience, even within the sphere of other media, has used two approaches: Starting from the 1990s, with the new potential of Internet and the web, these two approaches have undergone significant evolution. Automatic data measurement, that is the gathering and analysis of online traffic, is now generically indicated with the term of web analytics, a concept that includes the capacity to record server webs (logging capabilities), the technologies for “tagging” the digital contents (tagging technologies), the possibilities for “sniffing” the traffic in the network (network sniffing), in other words a series of technologies that use different data sources for detecting and analysing the manner in which users are active on the web. On the other hand, gathering data through more traditional investigations (questionnaires, etc.) has in its turn evolved, by taking advantage of the interaction potential of the web. In no way can it be affirmed that one method is better than another; the methodology to be used needs to be considered on a case by case basis, taking into account the information requirements and available resources. One of the fundamental issues is that regarding the process used for data gathering: Of course, the costs associated with measuring and analysing data means that sample data measurements are often preferred to census data measurements. 2.6.1 Census data measurements: web analytics The term web analytics refers to the study of the behaviour of the network users. This system of census data measurement does not entail the direct involvement of the subjects to be measured. A classification of these measurement systems can be made on the basis of the data sources of the behaviour of the users on the web: Various web analytics software and services are available commercially as open source (for example, Google Analytics, Shinystat, AWStats etc.). 2.6.2 Sample or user centred measurements The methodology of user centred measurement derives from “recording the activity of a sample of Internet users who are recruited to be representative of the entire universe of Internet users. The behaviour of these subjects is subsequently projected in order to estimate the behaviour of the entire population of Internet users through opportune statistical projections” (Australian Technical Committee for the Internet Industry). There are two fundamental elements on which a “user centred” measurement system is based: the identification of a sample of Internet users that is representative of the whole, and the actual monitoring of the online behaviour of these individuals. The sample chosen is usually described as a panel (according to Wikipedia, a panel is “the ‘quantity’ chosen on the basis of representative criteria, used for the statistic measurement of a specific universe. It is usually a group of persons or families included in a sample investigation”). So how does one recruit a panel? Generally speaking, research institutes adopt various techniques, among which telephone research with random dialing of numbers or personal interviews with person to person meetings. More recently, recruiting systems have used the post or the web, even if the latter risks skewing results due to the exclusion of non-Internet users. User-centred measurement methodology (by panel) is strongly recommended during the design of websites that are accessible to all user categories, including the most disadvantaged. In-depth For example in Italian national rules: having defined the objectives of the web product and experimented with alternative solutions, you proceed to an evaluation with the client in the use context, to then carry out any corrections and updates, to be submitted to progressive monitoring. “This methodology is based on four main conditions: http://www.pubbliaccesso.gov.it/biblioteca/documentazione/studio_lineeguida/index.htm 2.6.2.1 The meter Generally used in traditional media, the meter is a device for measuring quantities. It is downloaded as software and installed in the computers of the subjects to be monitored. The idea of applying this monitoring technique in the sphere of informatics dates to 1994, when it was used by a group of researchers for measuring the spread and use of application software. This measuring system entails a minimum involvement of the subjects to be analysed, so that in contrast to to server-centred and browser-centred systems (devices-based measurement), aimed at analysing “machine users”, it is effectively “user-centred” (user-centric measurement). It is no longer the machines and their software that is monitored, but the single individuals with their social-demographic and behavioural specificities. Fundamentally there are three processes on which monitoring software by meter placed in the PCs of the panellists is based: univocal identification of the individual that navigates; recording of the information regarding his navigation route; sending of the recordings to the companies who asked for the measurement analyses. Measurements based on panels with a meter are currently the best for gathering data on the navigators’ profiles, the ranking of sites and audience fluctuations among sites (source&loss). Measurement operations by panel and meter are basically the following: The advantages of this type of analysis are: The disadvantages are: 2.6.2.2 Standardized interview – Static textual questionnaire (see also 3.2) The most extensively used method of investigation of media audiences, including those of websites and portals, is a standardised interview. This is done by asking structured questions of all users or of a group of chosen individuals. This measuring system entails the direct involvement of the subjects to be analysed. The aim is to investigate their preferences, habits and behaviour, in order to verify effectiveness in terms of user satisfaction with choices made and to study behaviour during network navigation – in other words, to build a “profile”. The choice of those to be interviewed can be casual or not casual, according to whether or not the choice of those to be interviewed should be probabilistic or not. The following are among the casual methods for choosing a panel: Among the non-casual methods for choosing a panel are: Interviews can be made by telephone or in person, sent by e-mail (e-mail survey), or filled in online using graphical user interface elements such as menus and radio buttons (web interviewing, web-based survey). An online questionnaire can be submitted using the technology offered by the web (at the time of entering or leaving a site, the specific amount of time spent on a site, specific navigation behaviour, every access to the site, etc.), but it is important that a visitor is not constantly exposed to invitations to participate in an investigation. Normally an online questionnaire can be viewed on a full screen or inside various sizes of windows (pop-up). It is formed of a series of questions posed in different ways (open, closed, single, multiple, etc.) and with which the user can interact through interactive graphic solutions (buttons, drop down menus, boxes, advancing arrows, etc.). In order for the interview to be effective and to have high quality results, it is suggested that simple language is used and that a certain amount of care is dedicated to the functionality and aesthetics (look and feel) of the questionnaire. It is advised that the duration of the interview be communicated in advance, and kept to a minimum: we should remember that the users are being asked to dedicate some of their time to helping us! The sequence of the questions should be coherent and dynamic, preventing multiple answers, up to the conclusion of the interview and the final thank-you page. The thanks can be confined to two lines written at the end of the questionnaire, or to an e-mail of reply with a text along the lines of: “Thank you! Your email has been sent to us successfully...” The aim of a pleasant presentation and an effective structure is the achievement of a greater number of completed interviews, minimizing the rate of refusal or incompleteness. The questionnaire can be a chance for winning over a user by registering him with the newsletter or for recompensing him for the energy spent in compiling it, “making him a present” of resources that would usually be subject to payment or reserved (digital and non digital gadgets, subscriptions to exclusive newsletters, the privilege of cooperating in the content creation, etc.). The results can be used through statistical processes or by extracting individual suggestions for changes to the web application. Once the analysis has been completed, it is advised that the results of these surveys (website feedback survey results) be published with an indication of the number of questionnaires analysed and the suggestions that will be implemented or which have already been implemented on the web application. The following are the advantages of this type of technique: What is really critical is the truthfulness of the statements provided by the interviewee regarding his user type, a question that has been rendered even more complicated by the new reality of Internet that witnesses the increase of “virtual beings” (role games, chat, avatar, nicknames, etc), although studies in this sector are not yet consolidated. 2.6.3 Audience metrics Audience metrics is a discipline which originated within the sphere of advertising and marketing. In the web context, its major role is that of providing qualitative and quantitative indicators for the analysis of web application effectiveness. Audiences translated into numbers are defined as ratings. Although traditional media measurement systems are by now standardized (for example, average minutes for TV, average quarter of an hour for press, etc.), this is not the case for the Internet and the web. Let us consider the differences. In traditional media, the relations between the medium (TV, radio, cinema, press etc.) and the public (TV viewers, radio listeners, readers) is identified through measurement of the exposure time, without a close study of the motivations and effects of this exposure. Starting from the 1990s, new metrics (e-metrics, web-metrics, net-ratings) were identified for investigating and quantifying the relationship between Internet users and digital content. The new medium is no longer characterized by a simple “exposure” model, but becomes an “interactive” space of action. The viewer is transformed into an active user. Internet interaction means the use on the web of a space of variable size in which to put data (informative, promotional, advertising, multimedia, etc.) so that the user is not confined to just looking at them but is encouraged to interact by providing a form of reply. This strategy can be defined as a Call to action, in the sense that the contents are placed in the network inviting the user to “do something!” and also specifying “what to do”. Closely linked to this idea there is also the so-called “funnel process”: the user is involved in various processes that bring him from being a simple visitor to becoming involved in more interactive procedures, thus reacting to the communications to which he has been exposed. Of particular importance in the sphere of e-commerce, this process thus makes it possible to verify in real time the effect of a communication on users’ behaviour. There are many forms of interaction: clicking on a banner, filling in a form or questionnaire, making a purchase, downloading applications, participating in a community tool, inserting files, using a content collector, etc.). Within the sphere of exposure metrics, the following are the main indicators, each one of which is in its turn connected to more specific metrics: Within the sphere of interactivity metrics we can distinguish between: Interactivity metrics are used especially in the marketing sector, but they can also be important in the sphere of cultural web applications. The most important indicators are: Obviously, to make a correct analysis of the results, the indicators that analyse the negative results must be considered: Within the sphere of interactions, it is important to understand other concepts: Although they are not studied in depth in this context, we should not forget the indicators of communication costs, or the investments made to generate attention (costs for exhibiting information) or to generate action (costs for enticing interaction). 2.6.4 Log file analysis Servers that host web applications send users textual content, images, multimedia files, etc. In order to increase user satisfaction, their navigation pathways can be carefully monitored and analysed. Standard web server functionality includes the capacity to collect and store detailed information about web server activity. Every provider of server solutions includes his own systems for logging, which collect detailed information on the use of the site. This can then be analysed from various perspectives, in order to extract useful information for various roles (technical, scientific, research, marketing). The interpretation of network traffic makes it possible to extract indicators relevant, for example, to the number of accesses, the navigation routes, behaviour models, technical configuration of the devices used for connecting, etc. The web page requests made by users are stored in the form of log files that record the activities ofteh web server. Due to the considerable size of this type of file, their processing is usually carried out by specific software called log analyzers. These classify logs by type, that is to say access logs (page requests that reach the server with the time and date of the request as well as the IP addresses of the computer that requested the resource and the name of the resource requested by the user); error log (recording of the malfunctions or failures in the handling of resource requests); reference log (recording of the URLs from which the user comes, the search engines used and the key words used); agent log (recording of information regarding the browsers utilised by the end users). The analysis of the log files must however include wide margins of approximation. The most obvious disadvantages are the following: 2.6.5 Protection of privacy Online audience measurement may acquire personal data regarding network users. When individuals are interviewed or metered, they should be warned beforehand of the on-going audiometric procedures so that they are aware how information is being gathered about them. In the case of web analytics techniques, the recording of information is automatic and the individuals “measured” are not made aware of the procedures taking place. In both cases the legislation regarding rights to privacy and protection of personal data must be taken into consideration by any organisation which gathers web usage information. The Directive number 2002/58/CE of the EU regarding the treatment of personal data and protection of privacy in the sector of electronic communications4 foresees that the use of data gathering tools must be made known to the subjects being monitored so that such gathering is transparent. At the same time, the Directive recognises that such tools can be considered legitimate for improving online services. The World Wide Web Consortium (W3C) proposes the use of a standardized solution known as P3P (Platform for Privacy Preferences Project)5, a model that allows websites to state how they mean to use the information gathered from their users. Circulation Audit Board. Diamond, Fran. Web Traffic Analytics Double Click. Internet Audience Dynamics: Esomar |
Copyright Minerva Project 2008-09,
last revision 2008-09-19, edited by Minerva
Editorial Board. |