This paper examines database systems and design principles through the lens of a consultant project for an international electronics retailer. The analysis covers the implementation of relational database concepts including primary keys, foreign keys, and referential integrity across multiple organizational departments. The paper demonstrates how these foundational database design principles support data consistency and operational efficiency. Additionally, it explores the emerging role of big data in modern business analytics and decision-making, examining how organizations leverage vast data volumes from e-commerce transactions and social media to improve marketing strategies and predict customer behavior.
This paper presents a consultant's analysis of database systems and design principles for an international organization that sells high-end electronics. The organization operates across multiple departments, including payroll, human resources, finance, marketing, sales, and operations. Each department maintains distinct data requirements and compensation structures. The sales department is unique in that its employees receive commission payments in addition to their yearly salary and benefits, calculated by multiplying the employee's commission rate by the total amount of products sold. All other departments compensate employees with a yearly salary and benefits only. This assignment involves designing a relational database that accommodates these departmental differences while maintaining data integrity and consistency across the organization.
The foundation of relational database design rests on the implementation of primary and foreign keys. A primary key is an attribute or combination of attributes that uniquely identifies each tuple (row) within a relation (table). For example, in a vehicle registration database, the primary key could be either a license plate number or a combination of the owner's name and address. The primary key ensures that no two rows in a table are identical and provides a reliable reference point for other tables.
Relations can be connected to each other with the help of relationships. To establish these connections, a reference to the primary key of one relation is used in another table. This reference is called a foreign key. A foreign key in one table points to a primary key in another table, creating a logical link between related datasets. For instance, an invoice table might contain an employee ID as a foreign key, which references the employee ID primary key in the employee table.
Referential integrity is a fundamental constraint that ensures the validity of foreign key relationships. It is achieved when all entries in a foreign key column refer to valid primary key values in the referenced table, or contain null values. In an employee database, referential integrity means that each employee number in the payroll table exists as a primary key in the employee table or is null. When referential integrity is properly enforced, related datasets are modified, updated, and deleted together as a unit.
The consequences of failing to maintain referential integrity can be serious. According to Teradata (2003, p. 129), a critical problem occurs when an employee is deleted from the employee database without the associated record being deleted from the payroll database. This orphaned record could result in someone no longer employed still being paid by the organization. By enforcing referential integrity constraints, database administrators prevent such inconsistencies and ensure that the organization's data remains accurate and reliable across all related tables.
The database schema for this organization establishes relationships between multiple tables through ID numbers and entities representing the things, people, and events in the business process. The employee table maintains relationships with the invoice table because employees are responsible for sales transactions. Additionally, the employee table connects to the invoice line table, as these employees are selling products and generating individual line items on invoices.
The invoice table maintains relationships with both the product table and the employee table, since each invoice records what is being sold and who is selling it. These relationships create a network of data dependencies that reflect the organization's operational structure. When SQL queries are constructed to retrieve business information, these relationships must be properly joined to produce accurate results.
For example, a query to calculate commissions paid to sales employees for a specific period would need to join the employee table (for employee names and salary information), the invoice line table (for quantity sold), and potentially the product table (for pricing). A sample SQL statement to calculate December commissions might select employee names, yearly salary, quantity sold, commission rates, and apply formulas to compute total compensation including commissions. While such queries can become complex when multiple tables are involved, they reflect the underlying schema relationships and demonstrate how relational databases support business analytics and reporting.
Beyond traditional relational databases, big data represents a new era that enables companies to capture and analyze vast volumes of information to maintain competitive advantage. Big data can come in many forms. For example, data can consist of transactions from online stores. Online shopping has grown significantly in recent years as consumers increasingly find it convenient to purchase products digitally. When transactions occur, companies collect detailed logs of this data to improve their marketing and sales strategies. These logs help predict buying patterns, identify the age demographics of buyers, and determine optimal timing for product promotions.
According to Martin Courtney, there are several key characteristics of big data: high volume, high variety, high velocity, and high veracity. Many organizations beyond e-commerce leverage large data volumes. Social networking sites such as Facebook, Twitter, and YouTube generate enormous amounts of data daily. On Facebook, users share audio, video, and photos with their networks. To maximize user engagement, these companies continuously update their platforms to encourage interaction and community participation. As data continuously changes, software developers must create new methods to support hardware innovations and handle evolving data types and formats.
The availability of vast amounts of data improves organizational decision-making. More information becomes accessible at the click of a mouse, enabling companies to conduct experiments and form hypotheses about future trends and customer preferences. However, managing big data presents significant challenges. Organizations must address bandwidth limitations, as different users and systems operate at varying speeds, which can affect overall data processing performance. Larger platforms have developed batch processing jobs to reduce data size and optimize workflow across applications without creating bottlenecks. Another critical concern is data privacy and security, ensuring that sensitive data does not leave its primary server and remains protected from unauthorized access.
This analysis has examined the fundamental principles of relational database design and their application to organizational data management. The implementation of primary keys, foreign keys, and referential integrity constraints provides the structural foundation that ensures data accuracy and consistency. These concepts are not merely academic—they directly prevent operational failures such as orphaned records and duplicate payments. As organizations grow and data volumes increase exponentially, the foundational skills in database design become even more critical. Understanding both traditional relational database principles and emerging big data concepts equips professionals to design systems that support reliable business operations while enabling insight from massive, complex datasets.
You’re 98% through this paper. Sign up to read the full paper.
Sign Up Now — Instant Access Already a member? Log inAlways verify citation format against your institution’s current style guide requirements.