SQL Tutorial

9. SQL Security
10. Advanced Database Design
11. Working with Large Datasets
12. SQL and NoSQL Integration
13. SQL Best Practices and Common Mistakes
14. SQL Tools and Extensions
15. SQL Certification and Career Path
16. Appendices

9. SQL Security

SQL security is crucial for protecting sensitive data and ensuring the integrity of your database. This section covers key aspects of SQL security, including user management, preventing SQL injection, data encryption, and auditing.

9.1 User Roles and Permissions

Proper user management is the first line of defense in database security. It involves creating users, assigning roles, and granting specific permissions.

Creating Users and Roles

CREATE USER john_doe WITH PASSWORD 'secure_password';
    
    CREATE ROLE read_only;
    
    GRANT read_only TO john_doe;

Granting and Revoking Permissions

GRANT SELECT ON employees TO read_only;
    
    REVOKE INSERT, UPDATE, DELETE ON employees FROM read_only;

Role-Based Access Control (RBAC)

CREATE ROLE hr_manager;
    GRANT SELECT, INSERT, UPDATE ON employees TO hr_manager;
    GRANT SELECT ON payroll TO hr_manager;
    
    CREATE ROLE finance_analyst;
    GRANT SELECT ON financial_reports TO finance_analyst;
    
    CREATE USER jane_smith WITH PASSWORD 'secure_pwd123';
    GRANT hr_manager TO jane_smith;

Principle of Least Privilege

Always grant the minimum necessary permissions to users and roles. Regularly review and audit user permissions to ensure they align with current job responsibilities.

9.2 SQL Injection and Prevention

SQL injection is a technique where malicious SQL statements are inserted into application queries. It's one of the most common web hacking techniques.

Vulnerable Code (Do Not Use)

// This is vulnerable to SQL injection
    string query = "SELECT * FROM users WHERE username = '" + username + "' AND password = '" + password + "'";

Preventing SQL Injection

// Use parameterized queries
    string query = "SELECT * FROM users WHERE username = @Username AND password = @Password";
    command.Parameters.AddWithValue("@Username", username);
    command.Parameters.AddWithValue("@Password", password);

Input Validation and Sanitization

In addition to using parameterized queries, implement strict input validation and sanitization to further protect against SQL injection attempts.

Stored Procedures

Use stored procedures to encapsulate SQL logic and reduce the risk of SQL injection.

CREATE PROCEDURE GetUserByCredentials
        @Username NVARCHAR(50),
        @Password NVARCHAR(50)
    AS
    BEGIN
        SELECT * FROM Users
        WHERE Username = @Username AND Password = @Password
    END
    
    -- Usage
    EXEC GetUserByCredentials @Username = 'john', @Password = 'password123';

9.3 Data Encryption

Encryption is used to protect sensitive data stored in the database. Most modern database systems provide built-in encryption functions.

Transparent Data Encryption (TDE)

TDE encrypts the entire database at rest, including backups and log files.

-- Enable TDE (SQL Server example)
    USE master;
    GO
    CREATE MASTER KEY ENCRYPTION BY PASSWORD = 'StrongPassword123!';
    GO
    CREATE CERTIFICATE TDECert WITH SUBJECT = 'TDE Certificate';
    GO
    USE YourDatabase;
    GO
    CREATE DATABASE ENCRYPTION KEY
    WITH ALGORITHM = AES_256
    ENCRYPTION BY SERVER CERTIFICATE TDECert;
    GO
    ALTER DATABASE YourDatabase
    SET ENCRYPTION ON;
    GO

Column-Level Encryption

Encrypt specific columns containing sensitive data.

-- Create a symmetric key
    CREATE SYMMETRIC KEY SSNKey
    WITH ALGORITHM = AES_256
    ENCRYPTION BY CERTIFICATE TDECert;
    
    -- Encrypt data
    OPEN SYMMETRIC KEY SSNKey
    DECRYPTION BY CERTIFICATE TDECert;
    
    INSERT INTO Employees (Name, EncryptedSSN)
    VALUES ('John Doe', EncryptByKey(Key_GUID('SSNKey'), '123-45-6789'));
    
    -- Decrypt data
    SELECT Name, CONVERT(VARCHAR, DecryptByKey(EncryptedSSN))
    FROM Employees;

Encryption in Transit

Use SSL/TLS to encrypt data in transit between the client and the database server.

9.4 Auditing and Monitoring SQL Activity

Auditing helps track database activities, which is crucial for security and compliance purposes.

Setting Up Auditing (SQL Server Example)

-- Create a server audit
    CREATE SERVER AUDIT MyAudit
    TO FILE (
        FILEPATH = 'C:\SQLAudit\',
        MAXSIZE = 100 MB,
        MAX_ROLLOVER_FILES = 10
    );
    
    -- Create an audit specification
    CREATE SERVER AUDIT SPECIFICATION MyAuditSpec
    FOR SERVER AUDIT MyAudit
    ADD (FAILED_LOGIN_GROUP),
    ADD (SUCCESSFUL_LOGIN_GROUP),
    ADD (DATABASE_CHANGE_GROUP)
    WITH (STATE = ON);
    
    -- Enable the audit
    ALTER SERVER AUDIT MyAudit WITH (STATE = ON);

Querying Audit Logs

SELECT event_time, action_id, succeeded, session_id, server_principal_name
    FROM sys.fn_get_audit_file('C:\SQLAudit\*', NULL, NULL)
    WHERE action_id = 'LGIF' -- Failed logins
    ORDER BY event_time DESC;

Real-time Monitoring

Implement real-time monitoring solutions to detect and alert on suspicious activities.

Log Management

Establish a robust log management system to securely store and analyze audit logs for extended periods.

9.5 Additional Security Best Practices

Regularly update and patch your database management system
Implement strong password policies
Use multi-factor authentication for database access
Segment your network and use firewalls to control database access
Regularly backup your database and test restore procedures
Implement data masking for non-production environments
Conduct regular security audits and penetration testing
Educate your team on SQL security best practices

Summary

Implementing robust SQL security measures is essential for protecting your database and its contents. Key points to remember:

Use the principle of least privilege when granting permissions
Always use parameterized queries to prevent SQL injection
Encrypt sensitive data at rest and in transit
Implement comprehensive auditing to monitor and track database activities
Regularly review and update your security measures
Stay informed about the latest security threats and best practices

10. Advanced Database Design

Advanced database design techniques help create efficient, scalable, and maintainable database structures. This section covers normalization, denormalization, schema design best practices, and referential integrity.

10.1 Normalization (1NF, 2NF, 3NF, BCNF)

Normalization is the process of organizing data to minimize redundancy and dependency. It involves breaking down larger tables into smaller, more manageable ones.

Normal Form	Description
1NF	Eliminate repeating groups, create separate tables for each set of related data
2NF	Meet 1NF requirements and remove partial dependencies
3NF	Meet 2NF requirements and remove transitive dependencies
BCNF	Meet 3NF requirements and remove all determinants that are not candidate keys

Example: Normalizing a Table

Original table (not normalized):

CREATE TABLE orders (
        order_id INT,
        customer_name VARCHAR(100),
        product_name VARCHAR(100),
        quantity INT,
        order_date DATE,
        customer_email VARCHAR(100)
    );

After normalization (3NF):

CREATE TABLE customers (
        customer_id INT PRIMARY KEY,
        customer_name VARCHAR(100),
        customer_email VARCHAR(100)
    );
    
    CREATE TABLE products (
        product_id INT PRIMARY KEY,
        product_name VARCHAR(100)
    );
    
    CREATE TABLE orders (
        order_id INT PRIMARY KEY,
        customer_id INT,
        order_date DATE,
        FOREIGN KEY (customer_id) REFERENCES customers(customer_id)
    );
    
    CREATE TABLE order_items (
        order_id INT,
        product_id INT,
        quantity INT,
        PRIMARY KEY (order_id, product_id),
        FOREIGN KEY (order_id) REFERENCES orders(order_id),
        FOREIGN KEY (product_id) REFERENCES products(product_id)
    );

10.2 Denormalization Techniques

Denormalization is the process of adding redundant data to one or more tables to improve read performance. It's often used in data warehousing and reporting systems.

Example: Denormalizing for Reporting

CREATE TABLE order_summary (
        order_id INT PRIMARY KEY,
        customer_name VARCHAR(100),
        total_amount DECIMAL(10, 2),
        order_date DATE,
        -- Denormalized data
        product_count INT,
        last_product_name VARCHAR(100)
    );
    
    -- Populate the denormalized table
    INSERT INTO order_summary (order_id, customer_name, total_amount, order_date, product_count, last_product_name)
    SELECT 
        o.order_id,
        c.customer_name,
        SUM(oi.quantity * p.price) AS total_amount,
        o.order_date,
        COUNT(DISTINCT oi.product_id) AS product_count,
        FIRST_VALUE(p.product_name) OVER (PARTITION BY o.order_id ORDER BY oi.quantity DESC) AS last_product_name
    FROM orders o
    JOIN customers c ON o.customer_id = c.customer_id
    JOIN order_items oi ON o.order_id = oi.order_id
    JOIN products p ON oi.product_id = p.product_id
    GROUP BY o.order_id, c.customer_name, o.order_date;

10.3 Database Schema Design Best Practices

Use appropriate data types to conserve storage and improve performance
Implement proper indexing for frequently queried columns
Use constraints (e.g., NOT NULL, UNIQUE) to enforce data integrity
Consider future scalability when designing tables and relationships
Use consistent naming conventions for tables, columns, and other database objects

Example: Implementing Best Practices

CREATE TABLE products (
        product_id INT PRIMARY KEY,
        product_name VARCHAR(100) NOT NULL,
        product_code VARCHAR(20) UNIQUE,
        category_id INT,
        price DECIMAL(10, 2) CHECK (price >= 0),
        created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
        updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP ON UPDATE CURRENT_TIMESTAMP,
        FOREIGN KEY (category_id) REFERENCES categories(category_id)
    );
    
    -- Create an index for frequently queried columns
    CREATE INDEX idx_product_name ON products(product_name);
    CREATE INDEX idx_product_category ON products(category_id);

10.4 Referential Integrity and Foreign Keys

Referential integrity ensures that relationships between tables remain consistent. Foreign keys are used to enforce these relationships.

Example: Implementing Referential Integrity

CREATE TABLE orders (
        order_id INT PRIMARY KEY,
        customer_id INT,
        order_date DATE,
        FOREIGN KEY (customer_id) REFERENCES customers(customer_id)
            ON DELETE RESTRICT
            ON UPDATE CASCADE
    );
    
    CREATE TABLE order_items (
        order_id INT,
        product_id INT,
        quantity INT,
        PRIMARY KEY (order_id, product_id),
        FOREIGN KEY (order_id) REFERENCES orders(order_id)
            ON DELETE CASCADE,
        FOREIGN KEY (product_id) REFERENCES products(product_id)
            ON DELETE RESTRICT
    );

Foreign Key Action	Description
RESTRICT	Prevents the deletion of a parent row if it has related child rows
CASCADE	Automatically deletes or updates related child rows when a parent row is deleted or updated
SET NULL	Sets the foreign key column(s) to NULL when the parent row is deleted or updated
NO ACTION	Similar to RESTRICT, but checked at the end of the statement

Summary

Advanced database design techniques help create efficient and maintainable database structures. Key points to remember:

Use normalization to reduce data redundancy and improve data integrity
Apply denormalization techniques when appropriate for performance optimization
Follow best practices in schema design, including proper data types and indexing
Implement referential integrity using foreign keys to maintain data consistency
Regularly review and optimize your database design as requirements evolve

11. Working with Large Datasets

As databases grow, managing and querying large datasets becomes challenging. This section covers techniques for handling big data efficiently.

11.1 Partitioning Tables

Partitioning divides large tables into smaller, more manageable parts based on a partition key.

Example: Range Partitioning

CREATE TABLE sales (
        sale_id INT,
        sale_date DATE,
        amount DECIMAL(10, 2)
    )
    PARTITION BY RANGE (YEAR(sale_date)) (
        PARTITION p0 VALUES LESS THAN (2020),
        PARTITION p1 VALUES LESS THAN (2021),
        PARTITION p2 VALUES LESS THAN (2022),
        PARTITION p3 VALUES LESS THAN MAXVALUE
    );

11.2 Sharding Databases

Sharding distributes data across multiple database servers, improving performance and scalability.

Example: Sharding Strategy

-- Shard 1: Customers with IDs 1-1000000
    CREATE TABLE customers_shard1 (
        customer_id INT PRIMARY KEY,
        name VARCHAR(100),
        CHECK (customer_id BETWEEN 1 AND 1000000)
    );
    
    -- Shard 2: Customers with IDs 1000001-2000000
    CREATE TABLE customers_shard2 (
        customer_id INT PRIMARY KEY,
        name VARCHAR(100),
        CHECK (customer_id BETWEEN 1000001 AND 2000000)
    );

11.3 Data Warehousing Concepts

Data warehousing involves storing large amounts of historical data for analysis and reporting.

Example: Star Schema

CREATE TABLE fact_sales (
        sale_id INT PRIMARY KEY,
        date_key INT,
        product_key INT,
        customer_key INT,
        quantity INT,
        amount DECIMAL(10, 2),
        FOREIGN KEY (date_key) REFERENCES dim_date(date_key),
        FOREIGN KEY (product_key) REFERENCES dim_product(product_key),
        FOREIGN KEY (customer_key) REFERENCES dim_customer(customer_key)
    );

11.4 ETL Processes

ETL (Extract, Transform, Load) processes are used to move data from operational systems to data warehouses.

Example: Simple ETL Process

-- Extract
    CREATE TEMPORARY TABLE temp_sales AS
    SELECT * FROM source_system.sales
    WHERE sale_date >= CURRENT_DATE - INTERVAL 1 DAY;
    
    -- Transform
    UPDATE temp_sales
    SET amount = amount * 1.1
    WHERE product_category = 'Electronics';
    
    -- Load
    INSERT INTO data_warehouse.fact_sales
    SELECT * FROM temp_sales;

11.5 Working with Big Data in SQL

Big Data often requires specialized tools and techniques beyond traditional SQL databases.

Example: Using SQL with Hadoop

-- Using Hive SQL to query Hadoop data
    SELECT 
        year, 
        COUNT(*) AS total_sales,
        SUM(amount) AS total_revenue
    FROM sales_data
    WHERE year >= 2020
    GROUP BY year
    HAVING total_revenue > 1000000
    ORDER BY year;

Summary

Working with large datasets requires specialized techniques:

Use partitioning to divide large tables into manageable chunks
Implement sharding for horizontal scaling across multiple servers
Design data warehouses for efficient analysis and reporting
Develop ETL processes to move and transform data
Utilize big data technologies when traditional SQL databases are insufficient

12. SQL and NoSQL Integration

As data needs evolve, many organizations use both SQL and NoSQL databases. This section explores NoSQL concepts and how they integrate with SQL databases.

12.1 Understanding NoSQL Databases

NoSQL databases provide flexible schemas and scalability for certain types of data and use cases.

Common NoSQL Database Types

Type	Description	Example
Document	Stores data in JSON-like documents	MongoDB
Key-Value	Simple key-value data storage	Redis
Column-family	Stores data in column families	Cassandra
Graph	Stores data in nodes and edges	Neo4j

12.2 SQL vs. NoSQL

Comparing SQL and NoSQL databases helps in choosing the right tool for specific use cases.

Comparison Table

Feature	SQL	NoSQL
Schema	Fixed	Flexible
Scalability	Vertical	Horizontal
ACID Compliance	Strong	Varies
Query Language	SQL	Database-specific

12.3 Using SQL with NoSQL Databases

Some NoSQL databases offer SQL-like query languages or interfaces for easier integration.

Example: SQL-like Queries in MongoDB

// MongoDB query using SQL-like syntax
    db.runCommand({
      find: "customers",
      filter: { age: { $gt: 25 } },
      sort: { name: 1 },
      limit: 10
    })
    
    // Equivalent SQL query
    SELECT * 
    FROM customers
    WHERE age > 25
    ORDER BY name
    LIMIT 10;

12.4 Polyglot Persistence

Polyglot persistence involves using multiple database types to leverage their strengths for different data needs.

Example: Polyglot Persistence Architecture

// User profiles in SQL database
    CREATE TABLE users (
        user_id INT PRIMARY KEY,
        username VARCHAR(50),
        email VARCHAR(100)
    );
    
    // User sessions in Redis (NoSQL key-value store)
    SET session:1234 "{\"user_id\": 1, \"last_access\": \"2023-04-15T10:30:00Z\"}"
    
    // User activity in MongoDB (NoSQL document store)
    db.user_activity.insert({
        user_id: 1,
        activity_type: "login",
        timestamp: ISODate("2023-04-15T10:30:00Z")
    })
    
    // Social connections in Neo4j (NoSQL graph database)
    CREATE (u1:User {id: 1})-[:FOLLOWS]->(u2:User {id: 2})

Summary

Integrating SQL and NoSQL databases can provide powerful solutions:

Understand the strengths and weaknesses of different NoSQL database types
Compare SQL and NoSQL features to choose the right tool for each use case
Utilize SQL-like interfaces in NoSQL databases when available
Implement polyglot persistence to leverage the strengths of multiple database types
Consider data consistency and integration challenges when using multiple database systems

13. SQL Best Practices and Common Mistakes

13.1 Writing Readable and Maintainable SQL Code

SELECT
        u.user_id,
        u.username,
        COUNT(o.order_id) AS total_orders,
        SUM(o.total_amount) AS total_spent
    FROM
        users u
    LEFT JOIN
        orders o ON u.user_id = o.user_id
    WHERE
        u.status = 'active'
    GROUP BY
        u.user_id, u.username
    HAVING
        total_orders > 5
    ORDER BY
        total_spent DESC
    LIMIT 10;

13.2 Avoiding Common SQL Pitfalls

-- Avoid using SELECT *
    SELECT id, name, email FROM users;
    
    -- Use proper indexing
    CREATE INDEX idx_user_email ON users(email);
    
    -- Avoid using NOT IN with subqueries
    SELECT * FROM orders
    WHERE user_id NOT IN (SELECT user_id FROM banned_users);
    
    -- Use NOT EXISTS instead
    SELECT * FROM orders o
    WHERE NOT EXISTS (
        SELECT 1 FROM banned_users b
        WHERE b.user_id = o.user_id
    );

13.3 Performance Tuning Best Practices

-- Use EXPLAIN to analyze query performance
    EXPLAIN SELECT * FROM users WHERE email = 'user@example.com';
    
    -- Use appropriate data types
    CREATE TABLE products (
        id INT,
        name VARCHAR(100),
        price DECIMAL(10, 2)
    );
    
    -- Use JOINs instead of subqueries when possible
    SELECT u.username, o.order_date
    FROM users u
    JOIN orders o ON u.id = o.user_id
    WHERE o.total_amount > 100;

14. SQL Tools and Extensions

14.1 SQL Client Tools (e.g., MySQL Workbench, pgAdmin)

SQL client tools provide graphical interfaces for database management and querying.

14.2 SQL Extensions (e.g., PL/SQL, T-SQL)

-- PL/SQL example
    CREATE OR REPLACE PROCEDURE update_product_price(
        p_product_id IN NUMBER,
        p_new_price IN NUMBER
    )
    IS
    BEGIN
        UPDATE products
        SET price = p_new_price
        WHERE product_id = p_product_id;
        
        COMMIT;
    EXCEPTION
        WHEN OTHERS THEN
            ROLLBACK;
            RAISE;
    END;

14.3 Using ORM with SQL (e.g., Hibernate, SQLAlchemy)

// Java Hibernate example
    @Entity
    @Table(name = "users")
    public class User {
        @Id
        @GeneratedValue(strategy = GenerationType.IDENTITY)
        private Long id;
    
        @Column(name = "username")
        private String username;
    
        @Column(name = "email")
        private String email;
    
        // Getters and setters
    }

14.4 SQL in Data Science (e.g., Integration with R, Python)

# Python with SQLAlchemy example
    from sqlalchemy import create_engine
    import pandas as pd
    
    engine = create_engine('postgresql://user:password@localhost:5432/mydatabase')
    
    df = pd.read_sql_query('SELECT * FROM users WHERE age > 30', engine)
    print(df.head())

14.5 SQL and Cloud Databases (e.g., AWS RDS, Azure SQL Database)

-- AWS RDS example
    CREATE TABLE users (
        id INT AUTO_INCREMENT PRIMARY KEY,
        username VARCHAR(50) NOT NULL,
        email VARCHAR(100) UNIQUE NOT NULL,
        created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
    );
    
    -- Azure SQL Database example
    CREATE TABLE products (
        id INT IDENTITY(1,1) PRIMARY KEY,
        name NVARCHAR(100) NOT NULL,
        price DECIMAL(10, 2) NOT NULL
    );

15. SQL Certification and Career Path

15.1 SQL Certification Paths

15.2 Preparing for SQL Certification Exams

15.3 Career Opportunities in SQL and Database Management

Summary

Pursuing SQL certifications and building a career in database management:

Choose a certification path aligned with your career goals and preferred database system
Utilize online resources and practice exams to prepare for certification tests
Explore various career opportunities in the field of database management and data analysis
Continuously update your skills to stay relevant in the evolving database landscape
Consider specializing in specific areas such as data warehousing, big data, or cloud databases

16. Appendices

16.1 SQL Cheat Sheet

Basic Queries

Concept	Example
SELECT	`SELECT name, age FROM users WHERE age > 18;`
INSERT	`INSERT INTO users (name, email) VALUES ('John', 'john@example.com');`
UPDATE	`UPDATE products SET price = 19.99 WHERE id = 1;`
DELETE	`DELETE FROM orders WHERE status = 'cancelled';`

Joins

Join Type	Example
INNER JOIN	`SELECT * FROM orders o INNER JOIN customers c ON o.customer_id = c.id;`
LEFT JOIN	`SELECT * FROM employees e LEFT JOIN departments d ON e.dept_id = d.id;`
RIGHT JOIN	`SELECT * FROM orders o RIGHT JOIN products p ON o.product_id = p.id;`
FULL OUTER JOIN	`SELECT * FROM students s FULL OUTER JOIN classes c ON s.class_id = c.id;`

Aggregations and Grouping

Concept Example

GROUP BY

Concept	Example
GROUP BY	`SELECT country, COUNT(*) FROM customers GROUP BY country;`
HAVING	`SELECT product_id, AVG(price) FROM sales GROUP BY product_id HAVING AVG(price) > 100;`
Aggregate Functions	`SELECT COUNT(*) as total_orders, SUM(amount) as total_sales, AVG(amount) as avg_order_value, MAX(amount) as largest_order, MIN(amount) as smallest_order FROM orders;`

SELECT country, COUNT(*) FROM customers GROUP BY country;

HAVING

SELECT product_id, AVG(price)
        FROM sales
        GROUP BY product_id
        HAVING AVG(price) > 100;

Aggregate Functions

SELECT 
            COUNT(*) as total_orders,
            SUM(amount) as total_sales,
            AVG(amount) as avg_order_value,
            MAX(amount) as largest_order,
            MIN(amount) as smallest_order
        FROM orders;

Subqueries

Concept Example

Subquery in WHERE

Concept	Example
Subquery in WHERE	`SELECT name FROM products WHERE category_id IN ( SELECT id FROM categories WHERE name = 'Electronics' );`
Correlated Subquery	`SELECT e.name FROM employees e WHERE salary > ( SELECT AVG(salary) FROM employees WHERE department_id = e.department_id );`

SELECT name FROM products
        WHERE category_id IN (
            SELECT id FROM categories WHERE name = 'Electronics'
        );

Correlated Subquery

SELECT e.name
        FROM employees e
        WHERE salary > (
            SELECT AVG(salary) FROM employees
            WHERE department_id = e.department_id
        );

Set Operations

Operation Example

UNION

Operation	Example
UNION	`SELECT name FROM customers UNION SELECT name FROM employees;`
INTERSECT	`SELECT product_id FROM order_items INTERSECT SELECT product_id FROM inventory;`
EXCEPT	`SELECT customer_id FROM all_customers EXCEPT SELECT customer_id FROM active_customers;`

SELECT name FROM customers
        UNION
        SELECT name FROM employees;

INTERSECT

SELECT product_id FROM order_items
        INTERSECT
        SELECT product_id FROM inventory;

EXCEPT

SELECT customer_id FROM all_customers
        EXCEPT
        SELECT customer_id FROM active_customers;

Advanced Concepts

Concept Example

Window Functions

Concept	Example
Window Functions	`SELECT name, department, salary, RANK() OVER (PARTITION BY department ORDER BY salary DESC) as rank FROM employees;`
Common Table Expressions (CTE)	`WITH high_value_orders AS ( SELECT * FROM orders WHERE total_amount > 1000 ) SELECT customer_id, COUNT(*) as order_count FROM high_value_orders GROUP BY customer_id;`
PIVOT	`SELECT * FROM ( SELECT category, sales_amount FROM sales ) AS SourceTable PIVOT ( SUM(sales_amount) FOR category IN ([Electronics], [Clothing], [Books]) ) AS PivotTable;`

SELECT 
            name,
            department,
            salary,
            RANK() OVER (PARTITION BY department ORDER BY salary DESC) as rank
        FROM employees;

Common Table Expressions (CTE)

WITH high_value_orders AS (
            SELECT * FROM orders WHERE total_amount > 1000
        )
        SELECT customer_id, COUNT(*) as order_count
        FROM high_value_orders
        GROUP BY customer_id;

PIVOT

SELECT *
        FROM (
            SELECT category, sales_amount
            FROM sales
        ) AS SourceTable
        PIVOT (
            SUM(sales_amount)
            FOR category IN ([Electronics], [Clothing], [Books])
        ) AS PivotTable;

Data Definition Language (DDL)

Concept	Example
CREATE TABLE	`CREATE TABLE users ( id INT PRIMARY KEY, name VARCHAR(50), email VARCHAR(100) UNIQUE, created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP );`
ALTER TABLE	`ALTER TABLE users ADD COLUMN age INT;`
CREATE INDEX	`CREATE INDEX idx_lastname ON employees (last_name);`
CREATE VIEW	`CREATE VIEW active_users AS SELECT * FROM users WHERE status = 'active';`

16.2 Common SQL Error Codes and Solutions

Database	Error Code	Description	Possible Solution
MySQL	1054	Unknown column	Check column name and table structure
MySQL	1064	Syntax error	Review SQL syntax and fix any mistakes
MySQL	1146	Table doesn't exist	Verify table name and database selection
MySQL	1451	Foreign key constraint fails	Check related tables for conflicting data
PostgreSQL	42P01	Undefined table	Verify table name and schema
PostgreSQL	42703	Undefined column	Check column name in the table
PostgreSQL	23505	Unique violation	Ensure unique constraint is not violated
SQL Server	208	Invalid object name	Check table or view name
SQL Server	245	Conversion failed	Ensure data types are compatible
SQL Server	547	Constraint violation	Check foreign key or check constraints
Oracle	ORA-00001	Unique constraint violated	Ensure unique key is not duplicated
Oracle	ORA-00904	Invalid identifier	Check column or table name
Oracle	ORA-01017	Invalid username/password	Verify login credentials
SQLite	1	SQL logic error	Check SQL syntax
SQLite	8	Read-only database	Ensure write permissions
SQLite	19	Constraint failed	Check constraints on the table

16.3 Additional Resources and Further Reading

Comments & Discussion

Facing issues? Have questions? Post them here! We're happy to help!

Provide Feedback For This Article

We take your feedback seriously and use it to improve our content. Thank you for helping us serve you better!

Thanks for your feedback! If you have time, please provide details by selecting options below.

😊 Thanks for your time, your feedback has been registered!

SQL Tutorial Part 2

SQL Tutorial

Table of Contents

9. SQL Security

9.1 User Roles and Permissions

Creating Users and Roles

Granting and Revoking Permissions

Role-Based Access Control (RBAC)

Principle of Least Privilege

9.2 SQL Injection and Prevention

Vulnerable Code (Do Not Use)

Preventing SQL Injection

Input Validation and Sanitization

Stored Procedures

9.3 Data Encryption

Transparent Data Encryption (TDE)

Column-Level Encryption

Encryption in Transit

9.4 Auditing and Monitoring SQL Activity

Setting Up Auditing (SQL Server Example)

Querying Audit Logs

Real-time Monitoring

Log Management

9.5 Additional Security Best Practices

Summary

10. Advanced Database Design

10.1 Normalization (1NF, 2NF, 3NF, BCNF)

Example: Normalizing a Table

10.2 Denormalization Techniques

Example: Denormalizing for Reporting

10.3 Database Schema Design Best Practices

Example: Implementing Best Practices

10.4 Referential Integrity and Foreign Keys

Example: Implementing Referential Integrity

Summary

11. Working with Large Datasets

11.1 Partitioning Tables

Example: Range Partitioning

11.2 Sharding Databases

Example: Sharding Strategy

11.3 Data Warehousing Concepts

Example: Star Schema

11.4 ETL Processes

Example: Simple ETL Process

11.5 Working with Big Data in SQL

Example: Using SQL with Hadoop

Summary

12. SQL and NoSQL Integration

12.1 Understanding NoSQL Databases

Common NoSQL Database Types

12.2 SQL vs. NoSQL

Comparison Table

12.3 Using SQL with NoSQL Databases

Example: SQL-like Queries in MongoDB

12.4 Polyglot Persistence

Example: Polyglot Persistence Architecture

Summary

13. SQL Best Practices and Common Mistakes

13.1 Writing Readable and Maintainable SQL Code

13.2 Avoiding Common SQL Pitfalls

13.3 Performance Tuning Best Practices

14. SQL Tools and Extensions

14.1 SQL Client Tools (e.g., MySQL Workbench, pgAdmin)

14.2 SQL Extensions (e.g., PL/SQL, T-SQL)

14.3 Using ORM with SQL (e.g., Hibernate, SQLAlchemy)

14.4 SQL in Data Science (e.g., Integration with R, Python)

14.5 SQL and Cloud Databases (e.g., AWS RDS, Azure SQL Database)

15. SQL Certification and Career Path

15.1 SQL Certification Paths

15.2 Preparing for SQL Certification Exams

15.3 Career Opportunities in SQL and Database Management

Summary

16. Appendices

16.1 SQL Cheat Sheet

Basic Queries

Joins

Aggregations and Grouping

Subqueries

Set Operations

Advanced Concepts