Previous | Table of Contents | Next |
Now is the time to take a look at the different methods for linking together databases from many vendors. In general, there are three approaches to achieving these links:
We will be exploring the following issues:
You will find two schools of thought about having applications that span across multiple vendor databases. One group maintains that a single vendor should be able to provide a global, omniscient database that is capable of supporting all types of data requirementsfrom online transaction processing to decision support. Many vendors foster this belief by marketing their database products as suitable for every type of application, pointing out the nightmares that result when a multidatabase application attempts to make diverse databases communicate with each other in a seamless fashion. The opposing philosophy believes that the only way to successfully implement database technology is to use numerous databases, leveraging the strengths of each engine. As recently as five years ago, database vendors touted their products as a corporate panacea, appropriate for all types of data and application systems.
All of this began to change around the time when IBM conceded that its DB2 database was not appropriate for high-volume systems. Prior to this, DB2 was marketed as an all-purpose engine that was suitable for any application, regardless of the size or performance requirements. With time, it became clear that DB2 was not appropriate for systems that required thousands of transactions per second, so IBM began recommending IMS for very large, high-speed databases.
The whole multidatabase trend is a reaction to corporate acquisitions and weak development. Companies have had little or no strategic direction for databases, although it is doubtful that any company ever decided to go heterogeneous, either. Heterogeneous databases mean duplicating licenses and talent, and few companies have a large enough talent pool to support multiple databases. The difference between the engines is too minimal.
From this beginning, other vendors began to embrace the concept of the right database for the right application. Systems that required flexibility and ad-hoc query were implemented with relational databases, while systems with complex data relationships and high performance requirements were implemented with network or hierarchical database architectures. New types of specialized database products began appearing on the scene: special word-searchable databases for textual data, databases designed especially for CAD/CAM systems, and so on. Database systems began their long march away from corporate repositories into specialized niche markets, some of which are shown in Table 9.1.
Type of Database | Product |
---|---|
Text Databases | Oracle Book, Basis Plus, Folio, Fulcrum |
High-Speed Databases | IDMS, IMS, Teradata |
Object Databases | Objectivity/DB, Ontos, Versant |
Warehouse Databases | Red Brick Warehouse, MetaCube, Fusion |
Originally, the desire for a central repository of data was driven by the need to impose some control over data redundancy and multiple updates to the same information placed on various file systems. From this situation evolved our centralized systems, using a single-vendor database. However, end-user computingand to some degree, slow data transmissionhelped to justify to the corporate world a move to client/server platforms. Unfortunately, there has been no standard model for client/server data management. In spite of all the open systems standards (OSF, POSIX, X-Windows, IEEE, and so on), nothing truly solid has emerged for databases. Furthermore, the standards that we do havesuch as SQLare implemented in very different ways. While real-world developers wait anxiously for standards to emerge, they simply make their own decisions on the best DBMS platform. Other departments in the corporate world have followed the same thinking, choosing their idea of the best DBMS, and so on. The results of this scattered approach are islands of information distributed among our LANs and WANsnot only distributed among different file systems on the same box, but distributed across the country and around the world. Linking these databases has become a very complicated but critical chore.
External influences have also impacted the database configuration of shops. Corporate acquisition, mergers, and right-sizing efforts have all had the side effect of leaving a plethora of database engines within the newly reformed organization. Regardless of the wisdom of having multiple types of databases, most IS departments must link more than one database, and cross-database connectivity has become commonplace. Sentry Market Research estimates that the average corporation has approximately nine different databases. End users are demanding applications that integrate legacy data from mainframes with data in open systems, while MIS designers are faced with the incredible challenge of making these diverse systems function as a unified whole.
While linking nonrelational databases such as IMS and CA-IDMS into relational databases is considered challenging, it is a common misconception that relational databases are basically the same and therefore easy to link together.
Previous | Table of Contents | Next |