Business intelligence (BI) is difficult to define because it is not a single technology. Rather, it is a collection of methodologies that are grouped together solely because they are all designed to help business people extract usable information from raw data.
There is a core set of technologies that are widely accepted as being part of BI: data warehousing, online analytical processing (Olap), and data mining. Quite rightly, these get the majority of the coverage when BI is discussed. However, there are several lesser known technologies that also deserve to be discussed and perhaps included under the BI umbrella. This briefing paper will consider just two, chosen to demonstrate the broad range of software that is currently available.
Query response
Operational data is almost always held in relational databases  these provide efficient data storage, high data integrity and allow questions to be asked of the data. The bad news is that they have a very poor query response when used with large sets of data. However, Olap can be used to speed up data queries.
Olap is an established approach to BI and is usually based on several methodologies and technologies. Data is first structured in a star schema with what are known as facts, dimensions and hierarchies. Facts are the data to be analysed, such as number of items sold, payment methods and so on. Typical dimensions might be time, customer, region and product. Each dimension is often arranged as a hierarchy, so time might be arranged as days, weeks, months, quarters and years.
Users can then ask questions such as, 'How many chairs did we sell in the first quarter of this year to male customers in Leeds?' and expect an almost instant reply. Olap provides such a rapid response by the simple expedient of calculating the answer to all the possible questions that can be asked. In other words it calculates all of the intersections of all the dimensions during the initial building of the data set  which is known as an Olap cube. But although Olap cubes provide astonishing speed, they have several disadvantages.
For a start, given, say, 10 dimensions (not an unusual number) each with 1,000 values, the number of intersections that can be calculated is 1029. Olap software usually has mechanisms to reduce this, but cubes are still massive. The number of possible intersections also means that they can take a long time to build. However, the main disadvantage of Olap goes right back to that star schema and those dimensions. At the point when cube designers choose the dimensions and hierarchies, they are limiting the questions that can be asked of that particular cube. For example, if a monthly hierarchy is not included, the user cannot get monthly totals of product sales. The cube can, of course, be rebuilt to include monthly totals, but that takes time.
This is where Alterian, a UK-based company, comes in. Alterian has attempted to combine the speed of Olap with the flexibility of a relational database. It provides a database engine that leaves the data structured as relational tables, which instantly cures Olap's problems  because the problems are an inevitable consequence of Olap structuring. However, the reason that Olap was developed in the first place was that relational tables are notoriously slow to query; queries can take hours or days to run. The Alterian solution overcomes this difficulty by making significant changes to the way in which the data is compressed and indexed.
Consider a 10-million row table with a yes/no field in which the majority of responses are yes. Traditional databases store them all: the Alterian engine can note the default value and then store only the exceptions. The engine has a host of different compression techniques available to it that it can apply to different data types. It draws from this pool of techniques and applies any or all that are appropriate. In fact, the choice of a given technique may depend not just upon the data type but also on the distribution of the data within a given field.
The tables are very heavily indexed and, similarly, the Alterian database engine can draw upon a variety of different indexing techniques. Indeed, some fields may end up indexed multiple times, depending on the structure of the table and its relationship with other tables.
It is worth noting that although this database is structured as a relational database, this does not mean that it is operational. In other words, it is designed to be used as a read-only database rather than as one where users are allowed to constantly update the data it contains.
Alterian really does seem to let users have their cake and eat it. The database remains structured in a very flexible way, allowing any and all queries to be run. The data volume is trivial compared to Olap, as is the load rate of more than 10GB per hour. The company says it offers query speeds in excess of 50 million rows per second using a desktop PC, while servers typically provide faster performance. So users should be able to obtain the query performance of Olap without the disadvantages.











