Advantages of using a Data Warehouse
The data warehouse is essential in all the organization since it is used in the storage of data; however it is type of relation database which is purposely designed for querying and analysis of the organizational data which had been stored for a relative long period of time.
The data ware house also uses some techniques to collect and manage the data that is retrieved from various sources in order to provide some insight to the business owners that they use in their decision making.
However the data warehouse do store the data in an electronic form and this information is then easily queried and analyzed by the relevant people though the data transformation processes to a relevant information which is then later shared to the targeted users within the set time to enable them improve in their businesses (Abraham, 2013).
The data ware house is also referred to as the decision support system and essentially the records in its tables are always maintained in a separate way from the normal organizations database.
Therefore the data warehouse is used mainly for the query and data analysis purpose since it is faster in response to any query or the reports generation process.
Below are some of the advantages of using the data ware house:
- It enables the business people to have access of some critical data from various sources as a single unit of information that is easy to use in decision making.
- It is able to provide information that is highly consistent from various sources.
- It easily does integration of multiple data sources to minimize the production problems (Raghu 2015).
- It highly minimized the time take to analyze data and produce report.
- The data warehouse enables users to have access of very critical data from multiple sources into one source.
- It is a best tool to store large historic data and thus users easily analyze the trend and can easily predict the future business out come.
This report involves designing and implementation of data warehouse of a business company known as “DIGITAL CLOTHS LTD” and mainly deals with production and sales of the cloths items which includes shoes, shirts, jackets, trousers, ties, caps and socks.
This company had transacted over a long period of time and the management was intending to analyze their business trends and essentially know the products that are highly sold over a period of time, the analysis will help the management to decide on the relevant change that they will add to their products or the products they will eliminate in their production process in order to boost their profit levels.
In order to come up with the final data ware house facts which will be analyzed, the research had to be conducted on the company’s business historical information where the research involved some systematic processes in order to gather information on the various products that the company sales and the respective duration they had been in the business.
The research will therefore enable the correction of the company’s business facts and thus will enable in solving the existing business challenges.
Research and Analysis of Findings
The core reason of this research is to determine the most selling product and the least selling product so that the management can decide on the product to maximize on their business transaction
In our research the qualitative research method was used since some data was to be picked from the database in a random manner; however this method used the relevant tools for random sampling and structured collection.
This method was used since it is easy to use and is more appropriate while summarizing, comparing and generalizing the results obtained and doing the hypothesis test.
In this research the data sample used was relatively small which included 28 sales that were made within a span of four days, this sample size was selected since it will be more appropriate and could give more accurate analysis results.
The data to be analyzed was corrected at a random from various organization sources where various techniques were used as below.
- Using interviews.
This process involved the management team where through the interview and discussion some information was corrected and was to be used in the analysis process.
- Using company secondary data from databases.
In this case the organization gave an access to their databases where some sample data was corrected and after the data cleaning process the data was analyzed.
- Using observations.
The observation was another technique that was used where through the observation data was picked and recorded awaiting the cleaning and analysis process.
SALES MADE |
|||||||
PRODUCTID |
PRODUCT_NAME |
DESCRIPTION |
UNITCOST |
UNITS |
TOTAL_COST |
CUSTOMER_NAME |
SALES DATE |
1 |
SHOES |
leather made |
100 |
2 |
200 |
PETER JAMES |
5/14/2018 |
1 |
SHOES |
leather made |
100 |
3 |
300 |
ALEX TITUS |
5/15/2018 |
1 |
SHOES |
leather made |
100 |
1 |
100 |
ELIZABETH N |
5/18/2018 |
2 |
SHIRTS |
cotton made |
120 |
1 |
120 |
PAUL ISSAAC |
5/14/2018 |
2 |
SHIRTS |
cotton made |
120 |
1 |
120 |
JANE JULIETA |
5/16/2018 |
3 |
JACKETS |
leather material s |
50 |
1 |
50 |
SAM PETERSON |
5/14/2018 |
4 |
TROUSERS |
cotton material made |
250 |
1 |
250 |
PETER JAMES |
5/14/2018 |
4 |
TROUSERS |
cotton material made |
250 |
1 |
250 |
ALEX TITUS |
5/14/2018 |
4 |
TROUSERS |
cotton material made |
250 |
1 |
250 |
PAUL ISSAAC |
5/15/2018 |
4 |
TROUSERS |
cotton material made |
250 |
1 |
250 |
ELIZABETH N |
5/16/2018 |
4 |
TROUSERS |
cotton material made |
250 |
2 |
500 |
JANE JULIETA |
5/17/2018 |
4 |
TROUSERS |
cotton material made |
250 |
3 |
750 |
SAM PETERSON |
5/14/2018 |
4 |
TROUSERS |
cotton material made |
250 |
1 |
250 |
CYRUS P |
5/15/2018 |
5 |
TIES |
khaki-cotton material |
50 |
1 |
50 |
ANN LUCY |
5/14/2018 |
5 |
TIES |
khaki-cotton material |
50 |
1 |
50 |
CYRUS P |
5/15/2018 |
5 |
TIES |
khaki-cotton material |
50 |
2 |
100 |
PAUL ISSAAC |
5/16/2018 |
5 |
TIES |
khaki-cotton material |
50 |
1 |
50 |
SAM PETERSON |
5/18/2018 |
6 |
CAPS |
jean material made |
100 |
1 |
100 |
PETER JAMES |
5/14/2018 |
6 |
CAPS |
jean material made |
100 |
1 |
100 |
PAUL ISSAAC |
5/15/2018 |
6 |
CAPS |
jean material made |
100 |
1 |
100 |
CYRUS P |
5/15/2018 |
6 |
CAPS |
jean material made |
100 |
1 |
100 |
JANE JULIETA |
5/16/2018 |
6 |
CAPS |
jean material made |
100 |
1 |
100 |
ELIZABETH N |
5/18/2018 |
7 |
SOCKS |
cotton wool made |
30 |
1 |
30 |
AGNES JYPTIAN |
5/14/2018 |
7 |
SOCKS |
cotton wool made |
30 |
1 |
30 |
PAUL ISSAAC |
5/14/2018 |
7 |
SOCKS |
cotton wool made |
30 |
1 |
30 |
CYRUS P |
5/15/2018 |
7 |
SOCKS |
cotton wool made |
30 |
1 |
30 |
ELIZABETH N |
5/16/2018 |
7 |
SOCKS |
cotton wool made |
30 |
2 |
60 |
SAM PETERSON |
5/17/2018 |
7 |
SOCKS |
cotton wool made |
30 |
1 |
30 |
JANE JULIETA |
5/18/2018 |
The corrected data was then analyzed in order to prove the users’ hypothesis wrong or right where it was done using the Ms Excel software and the data corrected was merged in a single table and various attributes were selected to do the analysis and plotting of graphs as below.
- Summarized data sample.
Count of UNITS |
Column Labels |
|||||
Row Labels |
5/14/2018 |
5/15/2018 |
5/16/2018 |
5/17/2018 |
5/18/2018 |
Grand Total |
CAPS |
1 |
2 |
1 |
1 |
5 |
|
JACKETS |
1 |
1 |
||||
SHIRTS |
1 |
1 |
2 |
|||
SHOES |
1 |
1 |
1 |
3 |
||
SOCKS |
2 |
1 |
1 |
1 |
1 |
6 |
TIES |
1 |
1 |
1 |
1 |
4 |
|
TROUSERS |
3 |
2 |
1 |
1 |
7 |
|
Grand Total |
10 |
7 |
5 |
2 |
4 |
28 |
- Average, maximum and minimum sales per day and per product.
Count of UNITS |
Column Labels |
||||||||
Row Labels |
5/14/2018 |
5/15/2018 |
5/16/2018 |
5/17/2018 |
5/18/2018 |
Grand Total |
Average Total |
Maximum Sale |
Minimum Sale |
CAPS |
1 |
2 |
1 |
1 |
5 |
2 |
5 |
1 |
|
JACKETS |
1 |
1 |
1 |
1 |
1 |
||||
SHIRTS |
1 |
1 |
2 |
1 |
2 |
1 |
|||
SHOES |
1 |
1 |
1 |
3 |
2 |
3 |
1 |
||
SOCKS |
2 |
1 |
1 |
1 |
1 |
6 |
2 |
6 |
1 |
TIES |
1 |
1 |
1 |
1 |
4 |
2 |
4 |
1 |
|
TROUSERS |
3 |
2 |
1 |
1 |
7 |
3 |
7 |
1 |
|
Grand Total |
10 |
7 |
5 |
2 |
4 |
28 |
|||
Average Total |
1 |
1 |
1 |
1 |
1 |
4 |
|||
Maximum Sale |
3 |
2 |
1 |
1 |
1 |
7 |
|||
Minimum Sale |
1 |
1 |
1 |
1 |
1 |
1 |
After the analysis process of the sample data that was corrected from the organization there are various observations that were that were obtained regarding the various products that the company produce and sales over a period of four days as summarized in the table below.
Product name |
Total sales |
Percentage sales |
Caps |
5 |
17.9% |
Jackets |
1 |
3.6% |
Shirts |
2 |
7.1% |
Shoes |
3 |
10.7% |
Socks |
6 |
21.4% |
Ties |
4 |
14.3% |
Trousers |
7 |
25% |
Therefore from the above sales summary table the highest selling product is trousers, socks, caps, ties, shoes, shirts and jackets in that chronological order, however it is a clear indication that the jackets are least bought items and therefore the company requires to eliminate them or do more marketing on the product for better yields.
The system prototype was developed using Ms Access database system where the various components were implemented to enable easy operation of the system .Some of the items integrated in the system included the tables, queries, reports, forms and the switch board which was used to link all the forms.
Below is the implementation process that was followed:
In the data warehouse the dimensions are the structures that are used in the categorization of the facts and the respective measures which are in form of attributes and they are mainly used in the process of making decisions in the organization.
In this system there are four main dimensions which includes the customers, production, and staffs where each dimension is used in storing some specific types of measures.
Below is the structure of the data warehouse dimensions that was used.
The above figure shows the structure of the dimension tables that are associated with various facts, below is the various dimension table structures and their creation in ms access.
In the ware house implementation apart from the dimensions there were the facts where these facts are in the fact table where some facts are the attributes that references the dimension tables and the other are the primary keys.
Below is the list of the facts in the data warehouse.
- SALEID
- PRODUCTID
- CUSTOMERID
- STAFFID
- UNITS
- SALESDATE
- Table attributes.
In this case the queries are designed to enable the retrieval of the sales information from all the tables that are linked to each other. The data extracted is then analyzed to be used in the organization decision making process.
Below are the queries that are created to implement the data warehouse.
SALES MADE QUERY
- The query structure.
Reference
Abraham, S. (2013) Database System Concepts .2nd edn.Michigan:McGraw-Hill.
Bipin, D.(2014) An Introduction to Database Systems.4th edn. Boston: Addison-Wessley Publication.
Raghu, R. (2015) Database Management Systems.2ndedn.New York: McGraw-Hill Higher Education.
Ullman, D.(2016) Principles of Database Systems.3rd edn. Berlin:Springer publishers.
Victor,V.(2013) Foundations of Databases.1st edn. New York: Pearson.