HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire HP Oracle BI Sizing Questionnaire This questionnaire is designed to gather information necessary for Hewlett-Packard to correctly size and configure an Oracle Data Warehouse/Business Intelligence system for a specific user load. Accuracy in the completion of this questionnaire will impact the ability of Hewlett-Packard to respond to your request quickly and correctly. Please ensure all questions are completed as accurately as possible. Please send the completed questionnaire to your HP sales or HP channel partner who has been assigned to help you, or directly to one of the region contacts below: Americas - Americas Solution Design Services Team E-Mail address:
[email protected] EMEA – EMEA Oracle Sizing Team Asia Pacific / Japan Mexico Brasil Other Latin America / Caribbean -
[email protected] -
[email protected] -
[email protected] -
[email protected] -
[email protected] HP endeavors to complete sizings within three (3) business days. A sizing report typically includes a summary of customer requirements, a listing of assumptions and information on the sizing approach taken, and specific recommendations for HP servers and storage. Further information about Hewlett-Packard technologies for Oracle may be obtained from the HP website at www.hp.com/go/oracle and the Oracle-focused sections available on the HP Oracle website at www.hporacle.com. (Version 6.4 May 2010) This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 1 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire Customer Information It is often necessary during the sizing of an Oracle Business Intelligence system to contact the customer or reseller for further information. If these details are not completed it could lead to delays in the processing of the sizing request. Without this information no sizing will be performed. Company Name: Company Address: Industry: Contact: Contact Title: Contact Telephone Number: Contact Email Address: Oracle Representative Name: Oracle Phone: Representative Oracle Sales Oracle Reseller Oracle Customer HP Sales or HP Reseller Oracle Rep Email Address: Siebel Opportunity ID: Estimated Purchase Date: Other Information: This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 2 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire Products and Implementation Schedule There might be differences between systems configurations needed to support the various versions of Oracle. Please indicate the version number as accurately as possible. Oracle Database Server Version: Oracle Warehouse Builder [Yes/No] Oracle Data Integrator [Yes/No] Partitioning [Yes/No] Oracle Advanced Analytics: OLAP, Data Mining [Yes/No] Oracle End User Access & Reporting Tools: BI Discoverer, BI Publisher [Yes/No] Oracle BI Suite Enterprise Edition [Yes/No] – separate questionnaire required Custom Application: Oracle JDeveloper / ADF / Other [Yes/No] Third Party Applications: [please list] Please provide details of your implementation schedule or phases: Phase Description Pilot Phase Phase 1 Phase 2 Phase 3 Final Phase Named Users per Phase Go Live Date This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 3 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire Operating System and Platform Selection If this is an existing Oracle environment, indicate source platform: Reasons for the migration? Existing Infrastructure Type of System Peak number of concurrent users Server Configuration Type Database Server/s: Model (i.e. rp7400, dl580, UE450) Number of CPUs and Clock speed Production _______users Single Node Multi-Node Quantity ________ O/S ____________ Model___________ Processors ______ Test/QA _______users Single Node Multi-Node Quantity _________ O/S ____________ Model___________ Processors ______ Cores___________ Speed __________ RAM ___________ LAN ____________ Development/ Training / Other _______users Single Node Multi-Node Quantity _________ O/S ____________ Model____________ Processors _______ Cores___________ Speed __________ RAM ____________ LAN _____________ Installed Memory LAN card (i.e. 100MB; 1GB, etc) Disk Subsystem Performance during Peak Usage (i.e. poor, ok, good, exceptional) % CPU utilization (Db tier) during peak usage % CPU utilization (Apps Tier) during peak usage % Memory utilization (Db tier) during peak usage % Memory utilization (Apps Tier) during peak usage % Disk utilization (Db tier) during peak usage Oracle Database/s Size (GB) Total Disk Space available (GB) Current Database Version Cores___________ Speed __________ RAM ___________ LAN ____________ Please select the preferred target operating system/hardware platform combination: Operating System Hardware Platform Select One This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 4 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire HP-UX HP Integrity (Itanium) HP Integrity (Itanium) Linux (Red Hat or SUSE) HP ProLiant (Intel or Opteron) 4 socket HP ProLiant 8 socket HP Integrity (Itanium) Microsoft Windows HP ProLiant (Intel or Opteron) 4 socket HP ProLiant 8 socket For HP Integrity or ProLiant servers, would you prefer an HP BladeSystem configuration (if feasible)? (Y/N) Do you require a scale out Oracle RAC cluster solution? OR Hewlett-Packard to provide a platform recommendation based on sizing and availability requirement (“best fit”)? (Y/N) This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 5 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire Oracle System Environment The use of several separate Oracle systems in a customer environment is referred to as the “Oracle System Environment”. The role and number of such systems depends upon the size and complexity of the Oracle Business Intelligence implementation. Typical system roles might be: • • • Production System – contains live data to which only the production users have access. Development System - enables ongoing customization of the DW/BI Application and/or development of new components. Test System - enables complete testing of DW/BI Products and Oracle Database upgrades and new software modules, prior to implementation in the production system. This system may also be used to test modifications to system components, such as the operating system, new device drivers, new hardware components, etc. Training System - enables customers to undergo training and become familiar with Oracle DW/BI Products (sandbox functionality) without impacting the development or production systems. • Hewlett-Packard recommends a minimum of four Oracle Systems - separate Oracle systems for Development, Test, Training and Production roles. The roles of any of the Oracle systems can be combined, however, we strongly recommend against this. The Hewlett-Packard Americas Oracle Solution Design Services (SDS) Team will support a minimum of a Two-system environment: Four Oracle System Environment Separate Production, Dev, Test and Training Systems. Highly Recommended! Three Oracle System Environment Separate Production and Test Systems; Combined Dev and Training System. Three Oracle System Environment Separate Production and Dev Systems; Combined Test and Training System. Three Oracle System Environment Separate Production and Training Systems; Combined Dev and Test System. (Y/N) (Y/N) (Y/N) (Y/N) Two Oracle System Environment Separate Prod System; Combined Dev, Test and Training System. Not Recommended! (Y/N) One Oracle System Environment Combined Production, Dev, Test and Training System. Not Supported by Oracle SDS! N/A If detailed requirements for each requested system are known, please complete a separate questionnaire for each and indicate in the “Other Information” field on Page 2 which system the questionnaire pertains to. Otherwise, indicate below the requirements for each additional system in the environment as a percentage of the production system: Performance capacity requirements of additional systems in terms of percentage of Production Development Test Training This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 6 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire Oracle Production User Definitions When specifying the number of users planning to access the production system, we observe two categories of user: Named / Licensed Users – the number of users that can potentially log on to Oracle. This is typically specified in the production system’s Oracle software license, and is the maximum number of users that may be defined in the production Oracle system, whether actually logged on or not. Peak Active / Concurrent Users – the peak number of logged-on users that are actively using Oracle. These users interact regularly with the system. Business Intelligence Query Definitions BI queries can be classified into three broad categories: 1) Simple (e.g., How many products were sold in a specific quarter?) 2) Medium (e.g., Which product contributed most to last month’s revenue in a particular region?) 3) Complex (e.g., What is the business trend for specific products for the last three years?) Simple queries look at small amounts of data in the database. Most of the time, they will analyze business information available in summary tables. These queries take full advantage of B-tree and bitmap indexes. Simple queries may also be covered queries where all fields required for the query may be defined in the indices. Medium queries demand more resources from the database. In addition to using indexes and summary tables, Medium queries also join multiple tables, and do calculations and aggregations. Complex queries are the most complex, often involving full-table scans, which create a substantial burden on database resources. Queries that analyze data for business trends require numerous sorting and multiple joining of operations, including calculations such as summations and averages. Since these large queries use such a large portion of system resources, it is important to limit the number of users when executing them. Raw User Data (TB) - Raw data is the actual source user data that is loaded into the database (just the data in the tables). Database size is typically 2-3x the size of the raw data to allow for indexes, summaries and aggregates. Storage space utilized is larger than the database size to allow for logs, temp, scratch space, etc. Not uncommon to see total storage space used of up to 10x raw data at some customers, this can be reduced with database data compression Entry User - Typically run standard / optimized pre-created queries or reports. Intermediate User - Runs a mix of standard reports, but also performs some level of ad-hoc queries and analysis. Does not typically run extremely complex queries that access a large amount of base data, mostly runs analysis against small subsets of data or summaries. Advanced User - Typically runs ad-hoc queries that scans larger portions of the fact tables. Can perform complex joins across large tables. Typically have much lower volume of queries performed, but far more complex and intensive queries. This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 7 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire Business Intelligence Architectures Sizing a Business Intelligence System involves knowing the nature and architecture of the system. These architectures have been broadly classified as Enterprise Data Warehouses, Data Marts and Extraction Engines [ETL]. Data warehouse A relational database that is designed for query and analysis rather than transaction processing. A data warehouse usually contains historical data that is derived from transaction data, but it can include data from other sources. It separates analysis workload from transaction workload and enables a business to consolidate data from several sources. In addition to a relational database, a data warehouse environment often consists of an ETL solution, an OLAP engine, client analysis tools, and other applications that manage the process of gathering data and delivering it to business users. Data Mart A data warehouse that is designed for a particular line of business, such as sales, marketing, or finance. In a dependent data mart, the data can be derived from an enterprise-wide data warehouse. In an independent data mart, data can be collected directly from sources. ETL An acronym that stands for Extraction, Transformation, and Load. ETL refers to the methods involved in accessing and manipulating source data and loading it into a data warehouse. The order in which these processes are performed varies. The questions below have been classified into sections based on the above architectures. Please fill in the appropriate sections that apply to your Business Intelligence solution. This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 8 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire Enterprise Data Warehouse Requirements Please specify for the Production Data Warehouse system: Estimate the size of the raw (source) data for the Data Warehouse Estimate the size of the Production Data Warehouse database [If unknown, Hewlett-Packard can provide an estimate] Type Fact and Dimension Tables Indexes Temporary Tables Transaction Logs Other – specify Total Size Estimate the rate of your database growth (i.e. 10% per year) Size Estimate the update characteristics of the production database Frequency Expected Load Time Estimate the Query Mix Ratio (% of each type) Definitions of Class Queries are on Page 6 Estimated time [in seconds] an average query should take Simple Medium Complex Simple Medium Complex Estimated sustained I/O throughput of the Data Warehouse (0.1-18 GB/sec) Estimate the number of concurrent users in each Entry Level category (actively doing work) Do not double count same users Intermediate Advanced Estimate the rate of concurrent user base growth (i.e. 5% over 2 years) Please include the Data Warehouse Architecture Diagram/Description and the logical and physical data models as attachments at the end of the document This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 9 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire Data Mart Requirements (If there’s a data mart, usually get same type & amount of info as for DW) Please specify for the Production Data Mart system: Estimate the size of the raw (source) data for the Data Mart Estimate the size of the Production Data Mart database [If unknown, Hewlett-Packard can provide an estimate] Type Fact and Dimension Tables Indexes Temporary Tables Transaction Logs Other – specify Total Size Estimate the rate of your database growth (i.e. 10% per year) Size Estimate the update characteristics of the production database Frequency Expected Load Time Estimate the rate of concurrent user base growth Estimate the Query Mix Ratio (% of each type) Definitions of Class Queries are on Page 6 Estimated time [in seconds] an average query should take Simple Medium Complex Simple Medium Complex Estimated sustained I/O throughput of the Data Mart (0.1-18 GB/sec) Estimate the number of concurrent users in each category (actively doing work) Do not double count same users Estimate the number of concurrent users (i.e. 5% over 2 years) Entry Level Intermediate Advanced This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 10 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire Indicate what end user query tool would be used, if any Indicate what front end report tool would be used, if any Indicate the planned architecture of the reporting tools [OLAP, ROLAP, MOLAP, HOLAP] Indicate if the above OLAP product be used for SQL reach-through to the application RDBMS Please include the Data Mart Architecture Diagram /Description and the logical and physical data models as attachments at the end of the document. This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 11 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire ETL [Extraction, Transformation and Load] Requirements Please specify for the ETL system: Indicate what ETL Tool will be used What is the source of data extraction [Oracle, DB2, IMS, SQL Server, Flat files etc.]? What is the Hardware platform on which the source databases and applications run on? Size Estimate the data extraction characteristics Frequency Expected Extraction Time Will extracts be required from Legacy system audit logs? Will data be presented to load/cleansing programs using fixed length records, CSV or any other means? Loads Indicate the method by which data will be input into the Data Warehouse Inserts Updates Deletes Size Estimate the data insertion [into Data Warehouse] characteristics Frequency Expected Insertion Time Indicate if the database will be updated in real time using trickle updates Tapes Indicate what data transfer process will be used Network [If so, indicate speed of link] Please include the Extraction Engine Architecture Diagram /Description and the logical and physical data models as attachments at the end of the document This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 12 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire Infrastructure Requirements Number and type of network interface cards: If Oracle RAC is required do you want Infiniband connectivity? Should HP propose the network infrastructure (switches, etc.)? (Y/N) Racking and Power Distribution Unit (PDU) requirements: Existing data center UPS? (Y/N) Production Oracle System Availability Requirements The proposed Production solution will include NSPOF (No Single Point of Failure) RAID disk storage subsystems, redundant Network Cards, Power and Fans, UPS power, etc where possible. To further enhance the availability of the Oracle BI production system, Hewlett-Packard can propose a High Availability (HA) Clustered solution. In the event of a production database server failure, the HA solution would provide automated fail-over to another production server, thereby allowing the system to be available within minutes. For HP-UX and Linux the solution would utilize HP Serviceguard. For OpenVMS, the solution would utilize OpenVMS Clustering. For Windows, the solution would utilize Oracle Failsafe and Microsoft Cluster Server. Note! Fail-over is not supported to a development, test or training system. High Availability Clustered Solution required? (Y/N) Will Real Application Clusters (RAC) be used? (Y/N) Available reduced performance level after fail-over (e.g. 50%): Maximum system outage / unplanned downtime: (e.g. 2 hours Monday to Friday between 7am and 6pm) Planned storage RAID architecture [RAID 1, 0+1, 5] Redundant hardware components (e.g. power supplies, I/O cards, etc) (Y/N) Please note! Should your maximum unplanned downtime requirement be less than 5 minutes, the solution will recommend Oracle RAC (Real Application Clusters). Disaster Recovery Requirements (in case of fire, flood, theft, or other major event) Would you like HP to propose a Disaster Recovery option? (Y/N) Maximum allowed recovery time in case of disaster: Is recovery from logical errors required? (Y/N) Please describe your DR requirements. Is there a DR site available? How far away is it (approximately)? How quickly must the DR plan go into effect? Do you require automated site failover? _____ Days _____ Hours Backup/Restore Requirements An on-line backup is made with the Oracle system running and is usually only necessary if user access to the Oracle system is required 24 hours a day: This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 13 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire Backup strategy (Enterprise level, etc.): Is an on-line backup of the production system required? (Y/N) Is an operator present to supervise the backup? (Y/N) Available backup window in hours: Maximum restore time in hours: Technical Skills Please provide answers to the following, to allow us to determine the level of required Technical Services: Any on-site Oracle Database Administrator skills for the chosen database version? (Y/N) Any on-site Oracle Technology skills for the chosen Oracle BI Product version? (Y/N) HP-UX only: Any on-site HP-UX System Administrator skills? (Y/N) Linux only: Any on-site Linux Engineer (RHCE, SAIR LCE, etc) skills? (Y/N) MS-Windows only: Any on-site Microsoft MCP or MCSE Server skills? (Y/N) Any on-site Hewlett-Packard API or ASE Systems Engineering skills? (Y/N) Are you an existing user of Hewlett-Packard servers? (Y/N) (If “Yes” please state how many years and which platforms) Who is/are your System Integrator/Implementation Partner(s)? This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 14 of 15 HP-Oracle Data Warehouse & Business Intelligence Sizing Questionnaire Other Requirements Please indicate any other factors that we should take into account for the sizing proposal. If you are using a custom application against an Oracle database, explain how the application works. Next Steps Please forward this document to the appropriate contact person. We shall return a response shortly. Thank you for considering Hewlett-Packard for your Oracle BI implementation. Americas - Americas Solution Design Services Team E-Mail address:
[email protected] EMEA – EMEA Oracle Sizing Team Asia Pacific / Japan Mexico Brasil Other Latin America / Caribbean -
[email protected] -
[email protected] -
[email protected] -
[email protected] -
[email protected] This document contains information protected by copyright, no part may be copied or reproduced in any form without prior written consent from Hewlett-Packard. HP-Oracle DW_BI Questionnaire_v6-4_30-May-2010.doc Page 15 of 15