About the program
This program is a 32-hour Snowflake developer program, which will help you pick up Snowflake a cloud Data warehouse tool – one of the most in demand modern data warehouse tool. This program is brought to you by Agile Testing Alliance, a global leader in bringing cutting edge upskilling programs for technology professionals.
Program duration – 32 Hours
Total Days 16 ( 2 hours per day) – 8 PM IST to 10 PM IST
Starting Date: 19th June 2023
Cost: 32000 + Taxes (This fee includes a course completion certificate from Agile Testing Alliance)
This course introduces participants to the challenges of traditional Data warehouse and helps them understands why there is a need of a cloud-based Data Warehouse and to Snowflake.
Participants are then introduced to snowflake Architecture and its advanced features like time travel, zero-copy clone, data share, and scheduling utilities like tasks along with a strong emphasis on performance optimization. The course emphasizes practical
implementation.
With the case studies, we will explain in a real-life project scenario, how Snowflake as a modern Data warehouse is implemented at an enterprise level.
Why Snowflake
Data warehouse is the core of any organization, as it is important for regulatory reporting and enables business to take key decisions.
With the sheer size of data warehouse, it is a challenge to scale up the compute and storage capacity of data warehouse. Data storage and compute resource in most of the Data Warehouse are tightly coupled which is also a disadvantage. Hence, there is need to move our Data Warehouse to such a framework wherein compute and storage capacity are decoupled and can be increased or decreased on demand.
Modern Cloud Data Warehouse solutions like snowflake meets the given requirement and offers great flexibility along with other unique features like Time Travel, Zero copy clone, Data Share and Data Market place.
Due to data security concerns, there is slight reluctance in moving the data from on premise to cloud. In Snowflake, data is encrypted in each layers at all times. The concept of customized encryption allows to bring your own keys and encrypt the data which further enhances the data security.
In a nutshell, Snowflakes clicks most of boxes for modern Data Warehousing needs and the program clicks all the boxes for someone who wants to move to the most in demand data warehouse tool in today’s world.
Pre-requisites
1. Good understanding of database concepts.
2. Need to have an experience/ exposure in Software Development life cycle.
3. Interest to learn new and cutting edge technology
If you are already into Data Base and/or Data Warehouse technology, this would immediately help you leapfrog into one of the most in-demand areas
Hardware / Software Pre-requisites
Participants should preferably carry their own laptops (BYOD) with at least 4GB RAM and good internet bandwidth. The program would be delivered virtually and would require you to attend regularly.
Course Handouts
Participants will be given course notes and lab exercise handouts
Table of contents / Learning Objectives
Introductions
Introduce participants and understand specific learning objectives, if any, from the participants.
Cloud Data Warehousing
1.1 Definition
Explain the concept of cloud data warehousing and how it differs from traditional on-premise data warehousing.
1.2 Features
Cover the benefits of cloud data warehousing, including scalability, flexibility, and cost-effectiveness.
Introduction to Snowflake
Provide an overview of Snowflake and its capabilities. Cover the basics of how Snowflake works, including its cloud-based architecture, available cloud platforms, available regions, types of available Snowflake editions.
2.1 Cloud Platforms
2.2 Cloud Regions
2.3 Editions
2.4 Releases
2.5 Eco System
2.6 Architecture
Describe the key components of Snowflake and provide an overview of Snowflake's cloud architecture.
2.7 Creation of a trial account
Show how to create a trial account to enable participants to have their own Snowflake trial account to perform the labs.
2.8 Tools and utilities
Introduce SnowSQL, WebUI, SnowSight, and discuss supported drivers.
Snowflake Architecture
Get deeper into Snowflake architecture. Cover the key components of Snowflake's cloud-based architecture, including its separation of storage and compute, virtual warehouses, and the role of metadata.
3.1 Compute Layer
Provide details on what the compute layer is, the types of available virtual warehouses, cost, auto-scaling, multi-cluster warehouses, and resource monitors.
3.2 Storage Layer
Provide details on what the storage layer is, the data storage process, scaling, cost, columnar storage, micro-partitioning, metadata, clustering, column, and micro-partition pruning.
3.3 Cloud Services Layer
Provide details on what the cloud services layer is, services managed in the cloud services layer, authentication, metadata management, access control, and caching.
Snowflake Objects
4.1 Account, Databases, and Schemas
4.2 Tables (and Data Types), Views, Materialized Views
4.3 Stored Procedures, UDF
4.4 Stages, File Formats, Pipes, Streams, Tasks
4.5 Other Objects
Data Ingestion
5.1 Bulk Loading
Discuss how to load large amounts of data quickly into Snowflake, copy options, loading semi-structured data, best practices, and examples.
5.2 Continuous Loading
Discuss how to load streaming data into Snowflake, best practices, and examples.
5.3 Alternatives to loading.
Data Unloading
Discuss how to unload data from Snowflake into multiple platforms.
Snowflake Features
7.1 Time Travel
7.2 Fail-Safe
7.3 Zero-Copy Cloning
7.4 Data Sharing
7.5 Data Marketplace
7.6 Data Replication
Snowflake SQL.
Discuss the SQL supported by Snowflake and some functions that can make our job easier.
UDF and Stored Procedures
Discuss UDF, stored procedures (Snowflake scripting) with exception handling, and transactions.
Streams and Tasks
Discuss change data capture by using streams and scheduling with tasks. Make a continuous data pipeline using Snowpipe, stream, procedure, and task.
Security and Compliance
Discuss Snowflake's security features, including encryption, access controls, and audit trails. Cover how Snowflake complies with various data security and privacy regulations.
Performance Tuning and the cost
Discussion on choosing right warehouse, Query Profile, Pruning and Result Scan. General guidelines on how to identify a bad performing query, queueing and improve the performance. Discussion on saving the cost.
Data Governance:
• Object Tagging
• Data Classification
• Masking Policies
• Row Access Policies
• Access History
• Object Dependencies
Alerts and Notifications
• Snowflake Alerts
• Email Notifications
Business Use Cases
• Real-world Use Cases: Provide real-world use cases and examples of how Snowflake has been implemented in various organizations across different industries.