Part 2: Preparing Your Weather Dataset and Uploading to Google Cloud Storage – RainPredict-AI Project
๐ค️ Part 2: Preparing Your Weather Dataset & Uploading to Google Cloud Storage
From Raw Data to Cloud-Ready CSV – Powered by Siraat AI Academy
Welcome back to the RainPredict-AI series at Siraat AI Academy! In Part 2, we’ll walk you through the most critical foundation of any ML project — data preparation. Today, you’ll learn how to create a weather dataset, format it for AutoML, and upload it to Google Cloud Storage for training. ☁️๐
๐ฏ Why This Step Is Critical
- AutoML models rely on high-quality, labeled datasets
- Cloud tools need structured input to run scalable ML pipelines
- This is where prediction accuracy starts — clean data = smart model!
๐งฐ Step-by-Step: Prepare & Upload the Dataset
- ๐ Create a CSV file with the following columns:
temperaturehumiditypressurerainfall(this is our label column!)
- ๐งช Fill in sample rows with real or synthetic data like:
temperature,humidity,pressure,rainfall 78,60,1015,0.4 85,55,1010,0.0 72,70,1008,1.2 - ๐พ Save the file as
weather_data.csv - ☁️ Upload to Google Cloud Storage:
- Go to your Google Cloud Console
- Enable the Cloud Storage API (if not done yet)
- Create a bucket (e.g.,
rainpredict-ai-data) - Upload your
weather_data.csvfile to the bucket
- ๐ Copy the public URL (or GCS path) of the CSV — you’ll need it in Part 3 to feed into AutoML!
⚡ Pro Tips for Cloud Dataset Prep
- Use consistent units (e.g., Fahrenheit or Celsius — don’t mix)
- No missing values — clean any blanks before upload
- Keep column names lowercase and avoid spaces
- Use 100–1000 rows to start (to stay within free limits)
✅ What You’ve Accomplished So Far:
- Created a labeled weather dataset with
features + label - Saved it in a CSV format — ready for AutoML
- Uploaded the file securely to Google Cloud Storage
- You're now prepared to feed real data into your AI model!
๐ Final Insights
- Structured data is the foundation of AI accuracy
- AutoML loves CSVs — format matters more than you think!
- You’ve completed the most essential “data engineering” step
- Uploading to Cloud = production-grade workflow!
๐ก Remember: Garbage in = garbage out. Start strong with clean data!
⏭️ Up Next: Part 3 – Using AutoML to Train Your Rainfall Prediction Model
Get ready to watch your dataset turn into a smart AI model — all through Google’s AutoML magic. No code, just clicks and training power!
๐ข Don’t miss it! Bookmark this blog and follow Siraat AI Academy for the full build series.
Comments
Post a Comment