Part 2: Preparing Your Weather Dataset and Uploading to Google Cloud Storage – RainPredict-AI Project

๐ŸŒค️ Part 2: Preparing Your Weather Dataset & Uploading to Google Cloud Storage

From Raw Data to Cloud-Ready CSV – Powered by Siraat AI Academy


Welcome back to the RainPredict-AI series at Siraat AI Academy! In Part 2, we’ll walk you through the most critical foundation of any ML project — data preparation. Today, you’ll learn how to create a weather dataset, format it for AutoML, and upload it to Google Cloud Storage for training. ☁️๐Ÿ“ˆ


๐ŸŽฏ Why This Step Is Critical

  • AutoML models rely on high-quality, labeled datasets
  • Cloud tools need structured input to run scalable ML pipelines
  • This is where prediction accuracy starts — clean data = smart model!

๐Ÿงฐ Step-by-Step: Prepare & Upload the Dataset

  1. ๐Ÿ“„ Create a CSV file with the following columns:
    • temperature
    • humidity
    • pressure
    • rainfall (this is our label column!)
  2. ๐Ÿงช Fill in sample rows with real or synthetic data like:
    temperature,humidity,pressure,rainfall
    78,60,1015,0.4
    85,55,1010,0.0
    72,70,1008,1.2
          
  3. ๐Ÿ’พ Save the file as weather_data.csv
  4. ☁️ Upload to Google Cloud Storage:
    • Go to your Google Cloud Console
    • Enable the Cloud Storage API (if not done yet)
    • Create a bucket (e.g., rainpredict-ai-data)
    • Upload your weather_data.csv file to the bucket
  5. ๐Ÿ”— Copy the public URL (or GCS path) of the CSV — you’ll need it in Part 3 to feed into AutoML!

⚡ Pro Tips for Cloud Dataset Prep

  • Use consistent units (e.g., Fahrenheit or Celsius — don’t mix)
  • No missing values — clean any blanks before upload
  • Keep column names lowercase and avoid spaces
  • Use 100–1000 rows to start (to stay within free limits)

✅ What You’ve Accomplished So Far:

  • Created a labeled weather dataset with features + label
  • Saved it in a CSV format — ready for AutoML
  • Uploaded the file securely to Google Cloud Storage
  • You're now prepared to feed real data into your AI model!

๐ŸŒŸ Final Insights

  • Structured data is the foundation of AI accuracy
  • AutoML loves CSVs — format matters more than you think!
  • You’ve completed the most essential “data engineering” step
  • Uploading to Cloud = production-grade workflow!

๐Ÿ’ก Remember: Garbage in = garbage out. Start strong with clean data!


⏭️ Up Next: Part 3 – Using AutoML to Train Your Rainfall Prediction Model

Get ready to watch your dataset turn into a smart AI model — all through Google’s AutoML magic. No code, just clicks and training power!

๐Ÿ“ข Don’t miss it! Bookmark this blog and follow Siraat AI Academy for the full build series.

Comments

Popular posts from this blog

Thrown Into the Azure River by AI — An AZ-104 Learning Story

Lecture 01 – Cloud Readiness & Digital Transformation: Understanding the Real Requirements

Lecture 02 – Foundations of Digital Transformation & Cloud Concepts