Books, Courses & Certifications

13 Top Generative AI Courses

Published

7 months ago

January 24, 2025

Onome

Generative AI is surely one of the most exciting AI advancements. With its growing influence, the demand for knowledge in this area has skyrocketed. If you’re looking to get into this fascinating field, taking a generative AI course is a great start.

This article will walk you through everything you need to know about generative AI courses, helping you choose the best one to match your needs and goals.

What Is Generative AI?

Generative AI refers to algorithms, like neural networks, that can generate new content based on the data they’ve been trained on. Unlike traditional AI, which is designed to recognize patterns and make predictions, generative AI creates something new.

From generating realistic images to composing music and writing text, the applications are vast and varied.

Why Learn Generative AI?

The benefits of learning generative AI are immense. Not only can it open up numerous career opportunities, but it also allows you to be at the forefront of technological innovation.

Are you an artist looking to explore new mediums? A developer aiming to create cutting-edge applications? Or a data scientist interested in expanding your skill set? Generative AI has something to offer.

Top Generative AI Courses

1. The AI Content Machine Challenge by AutoGPT

Description: This 28-day challenge is designed to transform your content creation process using generative AI. It offers daily practical challenges and guidance to help you master AI tools and techniques for efficient content creation.

Key Features:

28-day content creation challenge
Daily practical challenges and guidance
Lifetime access to materials
3 exclusive bonuses worth $67: AI Money Making Ideas, Creating SEO Article Prompts, Midjourney Prompts

Duration: 28 days

Price: $97

Link: autogpt.net/sp/the-ai-content-machine

2. DeepLearning.AI’s Generative Adversarial Networks (GANs) Specialization

Description: This course, offered by Coursera, covers the fundamentals of GANs, a popular type of generative AI. It includes modules on how GANs work, how to implement them, and their applications.

Key Features:

Introduction to GANs
Building and training GANs
Applications of GANs in various fields
Hands-on projects

Duration: 2 months at 10 hours a week

Price: $39/month

Link: www.coursera.org/specializations/generative-adversarial-networks-gans

3. Stanford University’s CS231n: Convolutional Neural Networks for Visual Recognition

Description: This course covers deep learning for computer vision, including generative models for image generation. It’s a highly respected course in the field of AI.

Key Features:

Convolutional neural networks (CNNs)
Generative models for image recognition and generation
Practical implementation and case studies
Lectures from leading AI researchers

Duration: 10 weeks

Price: Free (course materials available online)

Link: cs231n.stanford.edu

4. Google Cloud’s Generative AI Learning Path

Description: Offered by Google Cloud, this learning path includes multiple courses focusing on different aspects of generative AI. It’s designed for professionals looking to leverage Google’s AI tools.

Key Features:

Introduction to generative AI and its applications
Using Google Cloud’s AI tools for generative tasks
Practical labs and projects
Certification upon completion

Duration: Self-paced

Price: Free to access; cost for certification

Link: cloudskillsboost.google/paths/118

5. edX’s Generative AI for Everyone

Description: This course offers a decent curriculum covering a few generative AI tools and platforms. However, it relies on recorded videos from instructors, leaving you to learn on your own with no guidance. It lacks mentorship and doesn’t offer any projects to work on, resulting in a lack of practical exposure.

Key Features:

Overview of generative AI tools and platforms
Self-paced learning through recorded videos
Lack of mentorship and practical projects

Duration: 4 months (1 – 3 hours per week)

Price: $220.50

Link: edx.org/certificates/professional-certificate/ibm-generative-ai-for-everyone

6. 1stepGrow’s Advanced Data Science and Artificial Intelligence Course

Description: This is a well-structured course covering data science and AI comprehensively. It includes content on generative AI with various tools and platforms, along with project exposure. Live classes are conducted in real-time with industrial experts. They keep their batches small, allowing students to get personalized mentorship and guidance.

Key Features:

Comprehensive coverage of data science and AI
Real-time live classes with industrial experts
Small batch sizes for personalized mentorship
Domain specialization and industry project exposure
Job assistance and career support

Duration: 480+ Hours

Price: ₹7146/ month ($85)

Link: 1stepgrow.com/advanced-data-science-and-artificial-intelligence-course

7. Simplilearn: Applied Generative AI Specialization by Purdue University

Description: This is an online boot camp course conducted through a blended mode with a mix of live classes and recorded videos. They provide very limited projects to work on and also lack domain specialization, hindering expertise in a particular field. Moreover, they are quite expensive for the limited features they offer.

Key Features:

Blended mode of learning with live and recorded classes
Limited project work and domain specialization
Higher cost compared to features offered

Duration: 16 weeks

Price: $250/month

Link: simplilearn.com/applied-ai-course

8. upGrad: Advanced Certificate Program in Generative AI

Description: A well-structured course taught through a blended mode of learning, mostly relying on recorded videos. They provide very few projects to work on and focus more on theoretical aspects.

They conduct classes in large groups, making it challenging to address individual doubts and receive personalized mentorship. Furthermore, these courses tend to be expensive and lack specialized focus on particular domains.

Key Features:

Blended learning mode with a focus on recorded videos
Limited project work and practical exposure
Large class sizes and high cost

Duration: 4 months

Price: EUR 2,300

Link: upgrad.com/gb/advanced-certificate-program-generative-ai/

9. DataCamp Generative AI Course

Description: DataCamp’s AI program includes courses on generative AI, covering various tools and techniques. It offers interactive learning with hands-on projects and exercises.

Key Features:

Interactive learning with hands-on projects
Comprehensive coverage of generative AI tools and techniques
Self-paced learning

Duration: Self-paced

Price: Varies

Link: app.datacamp.com/learn/courses/generative-ai-concepts

10. Coursera: Generative AI with Large Language Models

Description: This course focuses on generative AI using large language models. It includes theoretical concepts, practical implementations, and real-world applications.

Key Features:

Focus on large language models
Theoretical and practical learning
Real-world applications and projects

Duration: 16hours

Price: Free

Link: coursera.org/learn/generative-ai-with-llms

11. Learnbay: Advanced AI and Machine Learning Certification Program

Description: This program offers advanced training in AI and machine learning, including generative AI models. It focuses on practical applications and industry projects.

Key Features:

Advanced training in AI and machine learning
Practical applications and industry projects
Comprehensive coverage of generative AI

Duration: Varies

Price: Varies

Link: learnbay.co/datascience/artificial-intelligence-certification-course

12. Intellipaat: Advanced Certification in Generative AI and Prompt Engineering

Description: This certification program covers generative AI and prompt engineering, focusing on practical skills and real-world applications. It includes hands-on projects and mentorship.

Key Features:

Focus on generative AI and prompt engineering
Practical skills and real-world applications
Hands-on projects and mentorship

Duration: Varies

Price: Varies

Link: intellipaat.com/generative-ai-prompt-engineering-course

13. LinkedIn Learning: Career Essentials in Generative AI by Microsoft

Description: This course, offered by LinkedIn Learning in collaboration with Microsoft, provides essential skills for a career in generative AI. It includes theoretical concepts, practical implementations, and industry insights.

Key Features:

Essential skills for a career in generative AI
Theoretical and practical learning
Industry insights from Microsoft

Duration: 5 hours

Price: Free

Link: linkedin.com/learning/paths/career-essentials-in-generative-ai-by-microsoft-and-linkedin

How to Choose the Right Generative AI Course

With so many options available, selecting the right course can be challenging. Here are some factors to consider:

1. Your Current Skill Level

Are you a beginner, intermediate, or advanced learner? Choose a course that matches your skill level to ensure you get the most out of it.

2. Course Content

Look at the syllabus and see if it covers the topics you’re interested in. Ensure it includes both theoretical knowledge and practical applications.

3. Instructor Expertise

The experience and background of the instructors can significantly impact the quality of the course. Check their credentials and previous work in the field of AI.

4. Hands-on Projects

Practical projects are crucial for understanding and applying generative AI concepts. Ensure the course includes real-world projects that you can work on.

5. Price

Consider your budget and see if the course offers value for money. Some courses might be expensive but offer extensive resources and support.

6. Duration

Make sure the course fits into your schedule. Some courses are self-paced, while others have fixed durations and deadlines.

Benefits of Learning Generative AI

1. Career Advancement

With the increasing demand for AI professionals, having expertise in generative AI can boost your career prospects and open up new job opportunities.

2. Innovative Applications

Generative AI is used in various innovative applications, from creating art and music to developing new products and services. Learning generative AI can enable you to contribute to these advancements.

3. Enhanced Creativity

Generative AI tools can enhance your creativity by providing new ways to generate ideas and content. Whether you’re an artist, writer, or designer, these tools can help you create unique and original work.

4. Problem Solving

Generative AI can help solve complex problems by generating new solutions and ideas. This can be particularly useful in fields like research, medicine, and engineering.

5. Personal Growth

Learning a new and challenging field like generative AI can be a rewarding experience. It can help you develop new skills, expand your knowledge, and stay updated with the latest technological trends.

The Bottom Line

Generative AI is a fascinating and rapidly evolving field with immense potential. Whether you’re looking to advance your career, explore new creative possibilities, or solve complex problems, learning generative AI can be incredibly rewarding.

With numerous high-quality courses available, you can find one that fits your needs and helps you achieve your goals.

FAQs

1. Which course is best for generative AI?

The best course for generative AI depends on your needs, but DeepLearning.AI’s GANs Specialization and The AI Content Machine Challenge by AutoGPT are highly recommended for comprehensive learning.

2. How can I learn generative AI?

You can learn generative AI by enrolling in online courses, practicing with hands-on projects, and studying relevant AI models and techniques.

3. Is there a generative AI certification?

Yes, several online platforms like Coursera, Udacity, and Simplilearn offer certifications in generative AI upon course completion.

4. How long is the generative AI course?

The length of a generative AI course varies, ranging from a few weeks to several months, depending on the depth and intensity of the program.

Source link

Related Topics:ai technology Artificial intelligence Generative AI

Up Next

New courses and certification updates from AWS Training and Certification in January 2025

Don't Miss

Courses, Free Resources, & Corporate Training

Onome

Click to comment

Books, Courses & Certifications

4-Year-Olds Pose Messy Challenge for Elementary Schools: Toilet-Training

Published

4 hours ago

September 5, 2025

Lasherica Thornton, EdSource

More 4-year-olds across California are entering transitional kindergarten (TK) this year — curious and eager to play and learn. But some aren’t fully potty-trained, posing an unexpected challenge for schools.

“They are younger, and they’re going to have more accidents,” said Elyse Doerflinger, a TK teacher in the Woodlake Unified School District in Tulare County. “Then what?”

It’s a question school districts across the state are grappling with as they expand TK to younger children.

Once designed to serve only children who missed the kindergarten age cutoff, transitional kindergarten, often referred to as TK, has expanded to include all 4-year-olds, including those who turn 4 on Sept. 1.

Teaching children through play is one thing, but handling potty issues is another. There’s state guidance, but with little to no local direction, toileting practices differ across the board.

Private preschools vary in their approach to toilet-training. Many programs train preschool teachers to help children with toileting while others require children to be potty-trained before enrolling. Public schools cannot require students to be toilet-trained, but elementary school teachers are often not trained to help.

Most districts have adopted a hands-off approach for staff to work with TK students who have an accident, which relies on verbal guidance to talk to a student through the bathroom door when changing out of soiled clothes. When that fails, those students must wait for their parents to come to campus and help, disrupting everyone’s learning.

“You can’t hold down a job if you’re constantly being called to the school to get your kid,” Doerflinger said.

Other districts have created procedures to support students through toileting plans or special education services. Some teachers, without district guidance, have found ways to help their students as best as they can.

But there’s not one model that will work for all districts, schools, or even classrooms, said Patricia Lozano, executive director of the advocacy group Early Edge California.

‘We Do Not Wipe’

“Can you come wipe me?” a Fresno Unified School District TK student yelled out last week from the connected bathroom in Kristi Henkle’s class.

“We don’t do that,” she responded.

“Who’s going to do it?” the student quickly replied. It was the third day of school.

Educators in TK frequently remind young learners to use the restroom, wash their hands and flush the toilet, among other bathroom etiquette, like putting toilet paper, not paper towels, into the toilet.

But wiping students, many educators say, is well beyond those tasks.

“We do not wipe,” said Shawna Adam, a TK teacher in Hacienda La Puente Unified School District in Los Angeles County. “Our aides are not trained; neither are we as teachers for doing potty training. We’re not going to be trained in doing toileting and wiping. I am a general ed teacher.”

Although aides or paraprofessionals work alongside teachers to share responsibilities for serving young students, who will assist with wiping depends on the school district, its toileting practices for TK and labor contract language.

In many cases, a certain paraeducator is paid more to assist with wiping or changing.

“A lot of kids are not fully potty-trained,” Oakland Unified School District TK teacher Amairani Sanchez said. She has 24 students and two aides this year because the student-staff ratio for TK went down to 10-to-1. “Now that I have that second aide, if a kid needs help wiping, my para does that.”

According to the California Department of Education’s 32-page toileting toolkit, districts and schools should engage with union representatives about “which jobs will include direct toileting support activities, such as assisting a child with changing clothing or cleaning themselves.”

While contract language often dictates duties, if that isn’t specified, local district guidance or policy is necessary for educators to rely on — if it exists.

The Madera Unified School District’s two-page toileting guidance instructs its educators to verbally guide children through the process of changing into clean clothes if they wet themselves, or to discreetly send them to the office in the event of a bowel movement accident. That is the go-to strategy in other districts, such as Hacienda La Puente Unified.

“If there are accidents, (your) child (must) be able to take off their soiled clothing by themselves and we’ll give them wipes. If there’s No. 2, then, that’s on you; we call you to come down and change them,” Adam bluntly tells TK parents.

Despite how much teachers may want to help a student in soiled clothes, most are wary of disciplinary action from their district or lawsuits from the student’s family.

Not enough districts have clear guidance with systems and supports in place for teachers to overcome the challenges created by the expansion of TK.

“Teachers across the state, in their unions, are fighting and advocating for more resources for our youngest students,” said David Goldberg, president of the California Teachers Association.

Proper Facilities Necessary

Without contract language or local guidance, educators sometimes reach a consensus on their own.

Special education paraprofessionals, who are often tasked with supporting children who are still in diapers, have helped general education teachers and aides create toileting resources for the classroom.

The special education teacher will “put men’s boxers over her own clothes to demonstrate how to pull your pants down,” Doerflinger said about the teacher at her school. “She’ll take paper plates and smear peanut butter all over and have the kids practice wiping it off. They learn you have to do it until it’s clean all the way.”

Even for districts that have adopted approaches, such as opt-in forms on whether students can be helped with toileting, educators need more than just guidance to handle it.

Teachers and aides require supplies, such as wipes and gloves, as well as a trash can for the proper disposal of materials.

Schools also need the right kinds of facilities.

In some schools, TK classrooms do not have their own bathrooms, and the young children must use the same bathroom as all other students. In such cases, educators are unable to guide a child through the process of cleaning and dressing themselves.

In one elementary bathroom, a naked TK student runs to the door each day, carrying her clothes because she doesn’t know how to put them back on.

That’s one of many reasons why having a bathroom in the TK classroom is ideal for 4-year-olds.

A lack of state funding impacted districts’ ability to add toilets and bathrooms for them.

California voters in November approved $40 billion in local construction bonds and $10 billion in a statewide bond for facilities, but none of those funds are exclusively for transitional kindergarten. Because districts are also struggling to meet facilities needs, such as outdated or deteriorating buildings, TK will likely not take priority.

Many school districts still report that facilities, including developing age-appropriate bathrooms connected to classrooms, are a top challenge in implementing universal prekindergarten, according to a June 2025 Learning Policy Institute report.

At Greenberg Elementary School in Fresno Unified, for example, one TK classroom has a bathroom, while the other doesn’t. The other classroom on campus with a bathroom is for kindergarten students, who also require smaller toilets.

Students in classes with attached bathrooms have the freedom to go on their own schedules. Those who do not have an in-class restroom, however, must follow a stricter schedule with all of them going to the bathroom at once or must trek to the elementary bathroom with an aide.

“The intention of having a second adult in the classroom is for them to be a second teacher,” not “the walker to the bathroom,” said Hanna Melnick, director of early learning policy at Learning Policy Institute. “It defeats the purpose.”

Two dozen students being on the same potty schedule, teachers say, can also lead to more mishaps, so teachers must quickly get students in and out to avoid a potty accident.

A Partnership With Families

Before this school year, there was a premise that 4-year-olds who were not potty-trained needed an individualized education plan (IEP) for special education services, which required an aide for changing diapers and helping with toileting.

But that shouldn’t be the norm, some teachers and experts say.

“If you actually think there could be a disability, then let’s assess and check,” said Doerflinger, the TK teacher in Woodlake Unified who has one student who is not potty-trained and has been identified as having special needs and another student in the process of being recognized for such services. “Some kids just have trauma. Some kids just take longer. Some kids are terrified of a bathroom with a loud flushing toilet.”

While there will need to be a mindset shift among educators who believe all students who are not potty-trained need special education services, teachers and experts agree that families should play an active role in potty training their 4-year-olds.

In Los Angeles Unified, the state’s largest school district, toileting is considered a team effort between families and schools, said Pia Sadaqatmal, the district’s chief of transitional programs.

For students to have “toileting independence,” they should have opportunities to practice their toileting skills and exercise that independence, she said, noting frequent bathroom reminders and breaks at school, step-by-step picture guides once in the restroom, and books, instructional materials and resources shared with families to support toileting at home.

By sharing resources with families, “we’re all using the same language from the time the child wakes up in the morning until the time the child goes to bed at night,” said Ranae Amezquita, the district’s early childhood education director.

“We say, ‘Are kids ready for school?’ Also, ‘Are our schools ready for kids?’” said Lozano with Early Edge. “That is something that schools need to think about.”

Source link

Books, Courses & Certifications

Most Candidates Fail These SQL Concepts in Data Interviews

Published

6 hours ago

September 5, 2025

Nate Rosidi

Image by author | Canva

An interviewer’s job is to find the most suitable candidates for the advertised position. In doing so, they will gladly set up SQL interview questions to see if they can catch you off guard. There are several SQL concepts at which candidates often fail.

Hopefully, you’ll be one of those who avoid that destiny, as I’ll explain these concepts in detail below, complete with examples of how to solve certain problems correctly.

# 1. Window Functions

Why It’s Hard: Candidates memorize what each window function does but don’t really understand how window frames, partitions, or ordering actually work.

Common Mistakes: A common mistake is not specifying ORDER BY in ranking window functions or value window functions, such as LEAD() or LAG(), and expecting the query to work or for the result to be deterministic.

Example: In this example, you need to find users who made a second purchase within 7 days of any previous purchase.

You might write this query.

WITH ordered_tx AS (
  SELECT user_id,
         created_at::date AS tx_date,
         LAG(created_at::DATE) OVER (PARTITION BY user_id) AS prev_tx_date
  FROM amazon_transactions
)

SELECT DISTINCT user_id
FROM ordered_tx
WHERE prev_tx_date IS NOT NULL AND tx_date - prev_tx_date <= 7;

At first glance, everything might seem right. The code even outputs something that might appear to be a correct answer.

First of all, we are lucky that the code works at all! This happens simply because I’m writing it in PostgreSQL. In some other SQL flavors, you’d get an error since ORDER BY is mandatory in ranking and analytical window functions.

Second, the output is wrong; I highlighted some rows that shouldn’t be there. Why do they appear, then?

They appear because we didn’t specify an ORDER BY clause in the LAG() window function. Without it, the row order is arbitrary. So, we are comparing the current transaction to some random earlier row for that user, not the one that occurred immediately before it in time.

This is not what the question asks. We need to compare each transaction to the previous one by date. In other words, we need to specify this explicitly in the ORDER BY clause within the LAG() function.

WITH ordered_tx AS (
  SELECT user_id,
         created_at::date AS tx_date,
         LAG(created_at::DATE) OVER (PARTITION BY user_id ORDER BY created_at) AS prev_tx_date
  FROM amazon_transactions
)

SELECT DISTINCT user_id
FROM ordered_tx
WHERE prev_tx_date IS NOT NULL AND tx_date - prev_tx_date <= 7;

# 2. Filtering With Aggregates (Especially HAVING vs. WHERE)

Why It’s Hard: People often don’t understand the execution order in SQL, which is: FROM -> WHERE -> GROUP BY -> HAVING -> SELECT -> ORDER BY. This order means that WHERE filters rows before aggregation, and HAVING filters after. That also, logically, means that you can’t use aggregate functions in the WHERE clause.

Common Mistake: Trying to use aggregate functions in WHERE in a grouped query and getting an error.

Example: This interview question asks you to find the total revenue made by each winery. Only wineries where 90 is the lowest number of points for any of their varieties should be considered.

Many will see this as an easy question and hastily write this query.

SELECT winery,
       variety,
       SUM(price) AS total_revenue
FROM winemag_p1
WHERE MIN(points) >= 90
GROUP BY winery, variety
ORDER BY winery, total_revenue DESC;

However, that code will throw an error stating that aggregate functions are not allowed in the WHERE clause. This pretty much explains everything. The solution? Move the filtering condition from WHERE to HAVING.

SELECT winery,
       variety,
       SUM(price) AS total_revenue
FROM winemag_p1
GROUP BY winery, variety
HAVING MIN(points) >= 90
ORDER BY winery, total_revenue DESC;

# 3. Self-Joins for Time-Based or Event-Based Comparisons

Why It’s Hard: The idea of joining a table with itself is quite unintuitive, so candidates often forget it’s an option.

Common Mistake: Using subqueries and complicating the query when joining a table with itself would be simpler and faster, especially when filtering by dates or events.

Example: Here’s a question asking you to show the change of every currency’s exchange rate between 1 January 2020 and 1 July 2020.

You can solve this by writing an outer correlated subquery that fetches the July 1 exchange rates, then subtracts the January 1 exchange rates, which come from the inner subquery.

SELECT jan_rates.source_currency,
  (SELECT exchange_rate 
   FROM sf_exchange_rate 
   WHERE source_currency = jan_rates.source_currency AND date="2020-07-01") - jan_rates.exchange_rate AS difference
FROM (SELECT source_currency, exchange_rate
      FROM sf_exchange_rate
      WHERE date="2020-01-01"
) AS jan_rates;

This returns a correct output, but such a solution is unnecessarily complicated. A much simpler solution, with fewer lines of code, involves self-joining a table with itself and then applying two date filtering conditions in the WHERE clause.

SELECT jan.source_currency,
       jul.exchange_rate - jan.exchange_rate AS difference
FROM sf_exchange_rate jan
JOIN sf_exchange_rate jul ON jan.source_currency = jul.source_currency
WHERE jan.date="2020-01-01" AND jul.date="2020-07-01";

# 4. Subqueries vs. Common Table Expressions (CTEs)

Why It’s Hard: People often get stuck on subqueries because they learn them before Common Table Expressions (CTEs) and continue using them for any query with layered logic. However, subqueries can get messy very quickly.

Common Mistake: Using deeply nested SELECT statements when CTEs would be much simpler.

Example: In the interview question from Google and Netflix, you need to find the top actors based on their average movie rating within the genre in which they appear most frequently.

The solution using CTEs is as follows.

WITH genre_stats AS
  (SELECT actor_name,
          genre,
          COUNT(*) AS movie_count,
          AVG(movie_rating) AS avg_rating
   FROM top_actors_rating
   GROUP BY actor_name,
            genre),
            
max_genre_count AS
  (SELECT actor_name,
          MAX(movie_count) AS max_count
   FROM genre_stats
   GROUP BY actor_name),
     
top_genres AS
  (SELECT gs.*
   FROM genre_stats gs
   JOIN max_genre_count mgc ON gs.actor_name = mgc.actor_name
   AND gs.movie_count = mgc.max_count),
     
top_genre_avg AS
  (SELECT actor_name,
          MAX(avg_rating) AS max_avg_rating
   FROM top_genres
   GROUP BY actor_name),
   
filtered_top_genres AS
  (SELECT tg.*
   FROM top_genres tg
   JOIN top_genre_avg tga ON tg.actor_name = tga.actor_name
   AND tg.avg_rating = tga.max_avg_rating),
     ranked_actors AS
  (SELECT *,
          DENSE_RANK() OVER (
                             ORDER BY avg_rating DESC) AS rank
   FROM filtered_top_genres),
   
final_selection AS
  (SELECT MAX(rank) AS max_rank
   FROM ranked_actors
   WHERE rank <= 3)
   
SELECT actor_name,
       genre,
       avg_rating
FROM ranked_actors
WHERE rank <=
    (SELECT max_rank
     FROM final_selection);

It is relatively complicated, but it still consists of six clear CTEs, with the code’s readability enhanced by clear aliases.

Curious what the same solution would look like using only subqueries? Here it is.

SELECT ra.actor_name,
       ra.genre,
       ra.avg_rating
FROM (
    SELECT *,
           DENSE_RANK() OVER (ORDER BY avg_rating DESC) AS rank
    FROM (
        SELECT tg.*
        FROM (
            SELECT gs.*
            FROM (
                SELECT actor_name,
                       genre,
                       COUNT(*) AS movie_count,
                       AVG(movie_rating) AS avg_rating
                FROM top_actors_rating
                GROUP BY actor_name, genre
            ) AS gs
            JOIN (
                SELECT actor_name,
                       MAX(movie_count) AS max_count
                FROM (
                    SELECT actor_name,
                           genre,
                           COUNT(*) AS movie_count,
                           AVG(movie_rating) AS avg_rating
                    FROM top_actors_rating
                    GROUP BY actor_name, genre
                ) AS genre_stats
                GROUP BY actor_name
            ) AS mgc
            ON gs.actor_name = mgc.actor_name AND gs.movie_count = mgc.max_count
        ) AS tg
        JOIN (
            SELECT actor_name,
                   MAX(avg_rating) AS max_avg_rating
            FROM (
                SELECT gs.*
                FROM (
                    SELECT actor_name,
                           genre,
                           COUNT(*) AS movie_count,
                           AVG(movie_rating) AS avg_rating
                    FROM top_actors_rating
                    GROUP BY actor_name, genre
                ) AS gs
                JOIN (
                    SELECT actor_name,
                           MAX(movie_count) AS max_count
                    FROM (
                        SELECT actor_name,
                               genre,
                               COUNT(*) AS movie_count,
                               AVG(movie_rating) AS avg_rating
                        FROM top_actors_rating
                        GROUP BY actor_name, genre
                    ) AS genre_stats
                    GROUP BY actor_name
                ) AS mgc
                ON gs.actor_name = mgc.actor_name AND gs.movie_count = mgc.max_count
            ) AS top_genres
            GROUP BY actor_name
        ) AS tga
        ON tg.actor_name = tga.actor_name AND tg.avg_rating = tga.max_avg_rating
    ) AS filtered_top_genres
) AS ra
WHERE ra.rank <= (
    SELECT MAX(rank)
    FROM (
        SELECT *,
               DENSE_RANK() OVER (ORDER BY avg_rating DESC) AS rank
        FROM (
            SELECT tg.*
            FROM (
                SELECT gs.*
                FROM (
                    SELECT actor_name,
                           genre,
                           COUNT(*) AS movie_count,
                           AVG(movie_rating) AS avg_rating
                    FROM top_actors_rating
                    GROUP BY actor_name, genre
                ) AS gs
                JOIN (
                    SELECT actor_name,
                           MAX(movie_count) AS max_count
                    FROM (
                        SELECT actor_name,
                               genre,
                               COUNT(*) AS movie_count,
                               AVG(movie_rating) AS avg_rating
                        FROM top_actors_rating
                        GROUP BY actor_name, genre
                    ) AS genre_stats
                    GROUP BY actor_name
                ) AS mgc
                ON gs.actor_name = mgc.actor_name AND gs.movie_count = mgc.max_count
            ) AS tg
            JOIN (
                SELECT actor_name,
                       MAX(avg_rating) AS max_avg_rating
                FROM (
                    SELECT gs.*
                    FROM (
                        SELECT actor_name,
                               genre,
                               COUNT(*) AS movie_count,
                               AVG(movie_rating) AS avg_rating
                        FROM top_actors_rating
                        GROUP BY actor_name, genre
                    ) AS gs
                    JOIN (
                        SELECT actor_name,
                               MAX(movie_count) AS max_count
                        FROM (
                            SELECT actor_name,
                                   genre,
                                   COUNT(*) AS movie_count,
                                   AVG(movie_rating) AS avg_rating
                            FROM top_actors_rating
                            GROUP BY actor_name, genre
                        ) AS genre_stats
                        GROUP BY actor_name
                    ) AS mgc
                    ON gs.actor_name = mgc.actor_name AND gs.movie_count = mgc.max_count
                ) AS top_genres
                GROUP BY actor_name
            ) AS tga
            ON tg.actor_name = tga.actor_name AND tg.avg_rating = tga.max_avg_rating
        ) AS filtered_top_genres
    ) AS ranked_actors
    WHERE rank <= 3
);

There is redundant logic repeated across subqueries. How many subqueries is that? I have no idea. The code is impossible to maintain. Even though I just wrote it, I’d still need half a day to understand it if I wanted to change something tomorrow. Additionally, the completely meaningless subquery aliases do not help.

# 5. Handling NULLs in Logic

Why It’s Hard: Candidates often think that NULL is equal to something. It’s not. NULL isn’t equal to anything — not even itself. Logic involving NULLs behaves differently from logic involving actual values.

Common Mistake: Using = NULL instead of IS NULL in filtering or missing output rows because NULLs break the condition logic.

Example: There’s an interview question by IBM that asks you to calculate the total number of interactions and the total number of contents created for each customer.

It doesn’t sound too tricky, so you might write this solution with two CTEs, where one CTE counts the number of interactions per customer, while the other counts the number of content items created by a customer. In the final SELECT, you FULL OUTER JOIN the two CTEs, and you have the solution. Right?

WITH interactions_summary AS
  (SELECT customer_id,
          COUNT(*) AS total_interactions
   FROM customer_interactions
   GROUP BY customer_id),
   
content_summary AS
  (SELECT customer_id,
          COUNT(*) AS total_content_items
   FROM user_content
   GROUP BY customer_id)
   
SELECT i.customer_id,
  i.total_interactions,
  c.total_content_items
FROM interactions_summary AS i
FULL OUTER JOIN content_summary AS c ON i.customer_id = c.customer_id
ORDER BY customer_id;

Almost right. Here’s the output. (By the way, you see double quotation marks (“”) instead of NULL. That’s how the StrataScratch UI displays it, but trust me, the engine still treats them for what they are: NULL values).

The highlighted rows contain NULLs. This makes the output incorrect. A NULL value is neither the customer ID nor the number of interactions and contents, which the question explicitly asks you to show.

What we’re missing in the above solution is COALESCE() to handle NULLs in the final SELECT. Now, all the customers without interactions will get their IDs from the content_summary CTE. Also, for customers that don’t have interactions, or content, or both, we’ll now replace NULL with 0, which is a valid number.

WITH interactions_summary AS
  (SELECT customer_id,
          COUNT(*) AS total_interactions
   FROM customer_interactions
   GROUP BY customer_id),
   
content_summary AS
  (SELECT customer_id,
          COUNT(*) AS total_content_items
   FROM user_content
   GROUP BY customer_id)
   
SELECT COALESCE(i.customer_id, c.customer_id) AS customer_id,
       COALESCE(i.total_interactions, 0) AS total_interactions,
       COALESCE(c.total_content_items, 0) AS total_content_items
FROM interactions_summary AS i
FULL OUTER JOIN content_summary AS c ON i.customer_id = c.customer_id
ORDER BY customer_id;

# 6. Group-Based Deduplication

Why It’s Hard: Group-based deduplication means you’re selecting one row per group, e.g., “most recent”, “highest score”, etc. At first, it sounds like you only need to pick one row per user. But you can’t use GROUP BY unless you aggregate. On the other hand, you often need a full row, not a single value that aggregation and GROUP BY return.

Common Mistake: Using GROUP BY + LIMIT 1 (or DISTINCT ON, which is PostgreSQL-specific) instead of ROW_NUMBER() or RANK(), the latter if you want ties included.

Example: This question asks you to identify the best-selling item for each month, and there’s no need to separate months by year. The best-selling item is calculated as unitprice * quantity.

The naive approach would be this. First, extract the sale month from invoicedate, select description, and find the total sales by summing unitprice * quantity. Then, to get the total sales by month and product description, we simply GROUP BY those two columns. Finally, we only need to use ORDER BY to sort the output from the best to the worst-selling product and use LIMIT 1 to output only the first row, i.e., the best-selling item.

SELECT DATE_PART('MONTH', invoicedate) AS sale_month,
       description,
       SUM(unitprice * quantity) AS total_paid
FROM online_retail
GROUP BY sale_month, description
ORDER BY total_paid DESC
LIMIT 1;

As I said, this is naive; the output somewhat resembles what we need, but we need this for every month, not just one.

One of the correct approaches is to use the RANK() window function. With this approach, we follow a similar method to the previous code. The difference is that the query now becomes a subquery in the FROM clause. In addition, we use RANK() to partition the data by month and then rank the rows within each partition (i.e., for each month separately) from the best-selling to the worst-selling item.

Then, in the main query, we simply select the required columns and output only rows where the rank is 1 using the WHERE clause.

SELECT month,
       description,
       total_paid
FROM
  (SELECT DATE_PART('month', invoicedate) AS month,
          description,
          SUM(unitprice * quantity) AS total_paid,
          RANK() OVER (PARTITION BY DATE_PART('month', invoicedate) ORDER BY SUM(unitprice * quantity) DESC) AS rnk
   FROM online_retail
   GROUP BY month, description) AS tmp
WHERE rnk = 1;

# Conclusion

The six concepts we’ve covered commonly appear in SQL coding interview questions. Pay attention to them, practice interview questions that involve these concepts, learn the correct approaches, and you’ll significantly improve your chances in your interviews.

Nate Rosidi is a data scientist and in product strategy. He’s also an adjunct professor teaching analytics, and is the founder of StrataScratch, a platform helping data scientists prepare for their interviews with real interview questions from top companies. Nate writes on the latest trends in the career market, gives interview advice, shares data science projects, and covers everything SQL.

Source link

Books, Courses & Certifications

Top Data Science Courses 2025

Published

8 hours ago

September 5, 2025

Matthew Mayo

Image by Editor | ChatGPT

# Introduction

There are a lot of data science courses out there. Class Central alone lists over 20,000 of them. That’s crazy! I remember looking for data science courses in 2013 and having a very difficult time coming across any. There was Andrew Ng’s machine learning course, Bill Howe’s Introduction to Data Science course on Coursera, the Johns Hopkins Coursera specialization… and that’s about it IIRC.

But don’t worry; now there are more than 20,000. I know what you’re thinking: with 20,000 or more courses out there, it should be really easy to find the best, high quality ones, right? 🙄 While that isn’t the case, there are a lot of quality offerings out there, and a lot of diverse offerings as well. Gone are the days of monolith “data science” courses; today you can find very specific training on performing specific operations on particular cloud manufaturer platforms, using ChatGPT to improve your analytics workflow, and generative AI for poets (OK, not sure about that last one…). There are also options for everything from one hour targeted courses to months long specializations with multiple constituent courses on broad topics. Looking to train for free? There are lots of options. So, too, are there for those looking to pay something to have their progress recognized with a credential of some sort.

# Top Data Science Courses of 2025

Let’s not waste anymore time. Here are a collection of 10 courses (or, in a few cases, collections of courses) that are diverse in terms of topics, lengths, time commitments, credentials, vendor neutrality vs. specificity, and costs. I have tried to mix topics, and cover the basis of contemporary cutting-edge techniques that data scientists are looking to add to their repertoire. If you’re looking for data science courses, there’s bound to be something in here that appeals to you.

// 1. Retrieval Augmented Generation (RAG) Course

Platform: Coursera
Organizer: DeepLearning.AI
Credential: Coursera course certificate

Teaches how to build end-to-end RAG systems by linking large language models to external data: students learn to design retrievers, vector databases, and LLM prompts tailored to real-world needs
Covers core RAG components and trade-offs: learn different retrieval methods (semantic search, BM25, Reciprocal Rank Fusion, etc.) and how to balance cost, speed, and quality for each part of the pipeline
Hands-on, project-driven learning: assignments guide you to “build your first RAG system by writing retrieval and prompt functions”, compare retrieval techniques, scale with Weaviate (vector DB), and construct a domain-specific chatbot on real data
Realistic scenario exercises: implement a chatbot that answers FAQs from a custom dataset, handling challenges like dynamic pricing and logging for reliability

Differentiator: Deep practical focus on every piece of a RAG pipeline, which is perfect for learners who want step-by-step experience building, optimizing, and evaluating RAG systems with production tools.

// 2. IBM RAG & Agentic AI Professional Certificate

Platform: Coursera
Organizer: IBM
Credential: Coursera Professional Certificate

Focuses on cutting-edge generative AI engineering: covers prompt engineering, agentic AI (multi-agent systems), and multimodal (text, image, audio) integration for context-aware applications
Teaches RAG pipelines: building efficient RAG systems that connect LLMs to external data sources (text, image, audio), using tools like LangChain and LangGraph
Emphasizes practical AI tool integration: hands-on labs with LangChain, CrewAI, BeeAI, etc., and building full-stack GenAI applications (Python using Flask/Gradio) powered by LLMs
Develops autonomous AI agents: covers designing and orchestrating complex AI agent workflows and integrations to solve real-world tasks

Differentiator: Unique emphasis on agentic AI and integration of the latest AI frameworks (LangChain, LangGraph, CrewAI, etc.), making it ideal for developers wanting to master the newest generative AI innovations.

// 3. ChatGPT Advanced Data Analysis

Platform: Coursera
Organizer: Vanderbilt University
Credential: Coursera course certificate

Learn to leverage ChatGPT’s Advanced Data Analysis: automate a variety of data and productivity tasks, including converting Excel data into charts and slides, extracting insights from PDFs, and generating presentations from documents
Hands-on use-cases: turning an Excel file into visualizations and a PowerPoint presentation, or building a chatbot that answers questions about PDF content, using natural language prompting
Emphasizes prompt engineering for ADA: teaches how to write effective prompts to get the best results from ChatGPT’s Advanced Data Analysis tool, empowering you to efficiently direct it
No coding experience required: designed for beginners; learners practice “conversing with ChatGPT ADA” to solve problems, making it accessible for non-technical users seeking to boost productivity

Differentiator: A unique, beginner-friendly focus on automating everyday analytics and content tasks using ChatGPT’s Advanced Data Analysis, ideal for those looking to harness generative AI capabilities without writing code.

// 4. Google Advanced Data Analytics Professional Certificate

Platform: Coursera
Organizer: Google
Credential: Coursera Professional Certificate + Credly badge (ACE credit-recommended)

Comprehensive 8-course series on advanced analytics: covers statistical analysis, regression, machine learning, predictive modeling, and experimental design for handling large datasets
Emphasizes data visualization and storytelling: students learn to create impactful visualizations and apply statistical methods to investigate data, then communicate insights clearly to stakeholders
Project-based, hands-on learning: includes lab work with Jupyter Notebook, Python, and Tableau, and culminates in a capstone project, with learners building portfolio pieces to demonstrate real-world analytics skills
Built for career advancement: designed for people who already have foundational analytics knowledge and want to step up to data science roles, preparing learners for roles like senior data analyst or junior data scientist

Differentiator: Google-created curriculum that bridges basic data skills to advanced analytics, with strong emphasis on modern ML and predictive techniques, making it stand out for those aiming for higher-level data roles.

// 5. IBM Data Engineering Professional Certificate

Platform: Coursera
Organizer: IBM
Credential: Coursera Professional Certificate + IBM Digital Badge

16-course program covering core data engineering skills: Python programming, SQL and relational databases (MySQL, PostgreSQL, IBM Db2), data warehousing, and ETL concepts
Extensive toolset coverage: students gain working knowledge of NoSQL and big data technologies (MongoDB, Cassandra, Hadoop) and the Apache Spark ecosystem (Spark SQL, Spark MLlib, Spark Streaming) for large-scale data processing
Focus on data pipelines and ETL: teaches how to extract, transform, and load data using Python and Bash scripting, how to build and orchestrate pipelines with tools like Apache Airflow and Kafka, and relational DB administration and BI dashboards construction
Project-driven curriculum: practical labs and projects include designing relational databases, querying real datasets with SQL, creating an Airflow+Kafka ETL pipeline, implementing a Spark ML model, and deploying a multi-database data platform

Differentiator: Broad, entry-level-friendly data engineering track (no prior coding required) from IBM, giving a job-ready foundation, while also introducing how generative AI tools can be used in data engineering workflows.

// 6. Data Analysis with Python

Platform: freeCodeCamp
Credential: Free certification

Free, self-paced certification on Python for data analysis: fundamentals such as reading data from sources (CSV files, SQL databases, HTML) and using core libraries like NumPy, Pandas, Matplotlib, and Seaborn for processing and visualization
Covers data manipulation and cleaning: introduces key techniques for handling data (cleaning duplicates, filtering) and performing basic analytics with Python tools, with learners practicing how to use Pandas for transforming data and Matplotlib/Seaborn for charting results
Extensive hands-on exercises: includes many coding challenges and real-world projects embedded in Jupyter-style lessons, with projects such as “Page View Time Series Visualizer” and “Sea Level Predictor”
Intermediate-level, in-depth curriculum: approximately 300 hours of content covering everything from basic Python through advanced data projects, designed for dedicated self-learners seeking a solid foundation in open-source data tools

Differentiator: Completely free and project-focused, with an emphasis on fundamental Python data libraries, and ideal for learners on a budget who want a thorough grounding in open-source data analysis tools without any enrollment fees.

// 7. Kaggle Learn Micro-Courses

Platform: Kaggle
Credential: Free certificates of completion

Free, interactive micro-courses on the Kaggle platform covering a wide range of practical data topics (Python, Pandas, data visualization, SQL, machine learning, computer vision, etc.), with each course taking ~3–5 hours
Highly practical and hands-on: each lesson is a notebook-style tutorial or short coding challenge; Pandas course emphasizes solving “short hands-on challenges to perfect your data manipulation skills”, data cleaning course focuses on real-world messy data
Self-paced and bite-sized: designed to be fun and fast, as the content is concise with instant feedback
Integrated with Kaggle’s community: learners can easily switch to Kaggle’s free notebook environment to practice on real datasets and even enter competitions

Differentiator: Offers a game-like, learning-by-doing approach on Kaggle’s own platform, and it one of the quickest ways to acquire practical data skills through short, challenge-driven modules and immediate coding feedback.

// 8. Lakehouse Fundamentals

Platform: Databricks Academy
Credential: Free digital badge

Short, introductory self-paced course (~1 hour of video) on the Databricks Data Intelligence Platform
Covers Databricks basics: explains the lakehouse architecture and key products, and shows how Databricks brings together data engineering, warehousing, data science, and AI in one platform
No prerequisites: designed for absolute beginners with no prior Databricks or data platform experience

Differentiator: Fast, vendor-provided overview of Databricks’ lakehouse vision, and the quickest way to understand what Databricks offers for data and AI projects directly from the source.

// 9. Hands-On Snowflakes Essentials

Platform: Snowflake University
Credential: Free digital badges

Collection of free, hands-on Snowflake workshops: for beginners, topics range from Data Warehousing and Data Lake fundamentals to advanced use-cases in Data Engineering and Data Science
Very interactive learning: each workshop features short instructional videos plus practical labs, and you must submit lab work on the Snowflake platform, which is auto-graded
Earnable badges: successful completion of each workshop grants you a digital badge (many are free) that you can share on LinkedIn
Structured track: Snowflake recommends a learning path (starting with Data Warehousing and progressing through Collaboration, Data Lakes, etc.), ensuring a logical progression from basics to more specialized topics

Differentiator: Gamified, lab-centric training path with real-time assessment, standing out for its required hands-on lab submissions and shareable badges, making it ideal for learners who want concrete proof of Snowflake expertise.

// 10. AWS Skill Builder Generative AI Courses

Platform: AWS Skill Builder
Credentials: Digital badge (for select plans/assessments)

Comprehensive set of generative AI courses and labs: aimed at various roles, the offerings span from fundamental overviews to hands-on technical training on AWS AI services
Covers generative AI topics on AWS: e.g. foundational courses for executives, learning plans for developers and ML practitioners, and deep dives into AWS tools like Amazon Bedrock (foundational model service), LangChain integrations, and Amazon Q (an AI-powered assistant)
Role-based learning paths: includes titles like “Generative AI for Executives”, “Generative AI Learning Plan for Developers”, “Building Generative AI Applications Using Amazon Bedrock”, and more, each tailored to prepare learners for building or using gen-AI solutions on AWS
Hands-on practice: many AWS gen-AI courses come with labs to try out services (e.g. building a generative search with Q, deploying LLMs on SageMaker, or using bedrock APIs), with earned skills directly tied to AWS’s AI/ML ecosystem

Differentiator: Deep AWS integration, as these courses teach you how to leverage AWS’ latest generative AI tools and platforms, making them best suited for learners already in the AWS ecosystem who want to build production-ready gen-AI applications on AWS.

Matthew Mayo (@mattmayo13) holds a master’s degree in computer science and a graduate diploma in data mining. As managing editor of KDnuggets & Statology, and contributing editor at Machine Learning Mastery, Matthew aims to make complex data science concepts accessible. His professional interests include natural language processing, language models, machine learning algorithms, and exploring emerging AI. He is driven by a mission to democratize knowledge in the data science community. Matthew has been coding since he was 6 years old.

Source link