setup backend code to make satellite predictions #3613

Mnoble-19 · 2024-10-10T07:11:36Z

Description

[x] Setup API endpoints to retrieve satellite predictions

Related Issues

Changes Made

Brief description of change 1
Brief description of change 2
Brief description of change 3

Testing

Tested locally
Tested against staging environment
Relevant tests passed: [List test names]

Affected Services

Which services were modified:
- Service 1
- Service 2
- Other...

Endpoints Ready for Testing

New endpoints ready for testing:
- Endpoint 1
- Endpoint 2
- Other...

API Documentation Updated?

Yes, API documentation was updated
No, API documentation does not need updating

Additional Notes

[Add any additional notes or comments here]

Summary by CodeRabbit

New Features
- Integrated cloud storage capabilities for satellite data access.
- Added a new route for satellite predictions, allowing users to request predictions based on geographical coordinates, including an optional city parameter.
- Introduced a new class for managing satellite predictions and enhanced methods for data extraction and processing.
Bug Fixes
- Minor formatting adjustments for improved code readability.
Chores
- Updated package dependencies with version constraints for better clarity and stability.
- Updated the Dockerfile to use a newer base image for improved performance.

coderabbitai · 2024-10-10T07:11:42Z

📝 Walkthrough

Walkthrough

The pull request introduces several enhancements across multiple files within the spatial module. Key changes include the addition of cloud storage capabilities for satellite data access, the introduction of a new route for satellite predictions, and the creation of classes and methods to facilitate satellite data processing and model retrieval. Notably, a new dictionary organizes satellite collections, and methods for initializing the Earth Engine and extracting data based on geographical coordinates are implemented.

Changes

File Path	Change Summary
`src/spatial/configure.py`	Added `PROJECT_BUCKET`, `satellite_collections` dictionary, and `get_trained_model_from_gcs` function.
`src/spatial/controllers/controllers.py`	Introduced `'/satellite_prediction'` route and `get_satellite_prediction` function.
`src/spatial/models/SatellitePredictionModel.py`	Expanded `SatellitePredictionModel` class with methods for data extraction and Earth Engine initialization.
`src/spatial/views/satellite_predictions.py`	Enhanced `SatellitePredictionView` class with updated `make_predictions` method to include city parameter.
`src/spatial/requirements.txt`	Updated package versions and added new dependencies related to cloud storage and data processing.

Assessment against linked issues

Objective	Addressed	Explanation
Introduce a GitHub template for PRs (#123)	❌	No changes related to GitHub templates were made.
Calculate exceedances (#456)	❓	The changes do not explicitly address exceedance calculations.

🌟 In the realm of data and skies so wide,
New features emerge, like stars they glide.
From clouds we fetch models, predictions take flight,
With satellites guiding us through day and night.
A journey of code, where insights bloom,
In the world of spatial, there's always more room! 🌌

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

codecov · 2024-10-10T07:12:49Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 27.04%. Comparing base (805d76a) to head (d7acd3d).
Report is 30 commits behind head on staging.

Additional details and impacted files

@@           Coverage Diff            @@
##           staging    #3613   +/-   ##
========================================
  Coverage    27.04%   27.04%           
========================================
  Files          146      146           
  Lines        21339    21339           
  Branches       273      273           
========================================
  Hits          5772     5772           
  Misses       15567    15567

src/spatial/views/satellite_predictions.py

+            })
+
+        except Exception as e:
+            return jsonify({'error': str(e)}), 500


To fix the problem, we need to ensure that detailed error information is not exposed to the end user. Instead, we should log the error details on the server and return a generic error message to the user. This can be achieved by using Python's logging module to log the exception details and then returning a generic error message in the response.

coderabbitai

Actionable comments posted: 11

🧹 Outside diff range and nitpick comments (5)

src/spatial/controllers/controllers.py (1)
51-53: Well-structured new route for satellite predictions.

The new route is correctly implemented and follows the existing patterns in the file. Great job on maintaining consistency! A couple of suggestions to consider:

It might be beneficial to add some basic error handling or input validation in this controller function. This could help catch and handle any potential issues before they reach the view layer.

Consider adding a docstring to the get_satellite_prediction function to describe its purpose and any expected request parameters.

Here's a potential enhancement to consider:
@controller_bp.route('/satellite_prediction', methods=['POST'])
def get_satellite_prediction():
    """
    Handle POST requests for satellite predictions.
    
    Expected request format:
    {
        // Add expected request parameters here
    }
    
    Returns:
        JSON response with prediction results or error message.
    """
    try:
        return SatellitePredictionView.make_predictions()
    except Exception as e:
        return jsonify({"error": str(e)}), 400
This addition provides documentation and basic error handling. Adjust the docstring and error handling as needed for your specific use case.
src/spatial/views/satellite_predictions.py (1)
33-33: Unnecessary f-string: Remove f Prefix

The string "satellite_model.pkl" does not contain any placeholders, so the f prefix is unnecessary.

Apply this diff to remove the unnecessary f prefix:
-     project_name, bucket_name, f"satellite_model.pkl"
+     project_name, bucket_name, "satellite_model.pkl"
🧰 Tools

🪛 Ruff

33-33: Undefined name project_name

(F821)

33-33: Undefined name bucket_name

(F821)

33-33: f-string without any placeholders

Remove extraneous f prefix

(F541)
src/spatial/models/SatellitePredictionModel.py (2)
41-46: Double-check the calculation of the week number

The week number is calculated using strftime('%V'), which follows the ISO 8601 standard. Depending on your application's requirements, this may or may not align with your expected week numbering system.

If you need the week number as per the Gregorian calendar, you might use:
 all_features['week'] = int(current_time.strftime('%V'))  # ISO 8601 week number
+ # Alternatively, for Gregorian calendar week number:
+ # all_features['week'] = current_time.isocalendar()[1]
4-5: Reorganize imports according to PEP 8 guidelines

PEP 8 recommends grouping imports into three sections: standard library imports, third-party imports, and local application imports, separated by blank lines. This enhances readability.

You can reorganize the imports as follows:
 from datetime import datetime, timezone
 from typing import Dict

+import ee
+from google.oauth2 import service_account

 from configure import Config, satellite_collections
Also applies to: 7-7
src/spatial/configure.py (1)
140-140: Remove unnecessary listing of bucket contents

On line 140, the call to fs.ls(bucket_name) lists the contents of the bucket but doesn't use the result. Removing this line can reduce unnecessary overhead and improve performance.

Here's the adjusted code:
 def get_trained_model_from_gcs(project_name, bucket_name, source_blob_name):
     fs = gcsfs.GCSFileSystem(project=project_name)
-    fs.ls(bucket_name)
     with fs.open(bucket_name + "/" + source_blob_name, "rb") as handle:
         job = joblib.load(handle)
     return job

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between 11cd411 and 897348c.

📒 Files selected for processing (4)

src/spatial/configure.py (2 hunks)
src/spatial/controllers/controllers.py (2 hunks)
src/spatial/models/SatellitePredictionModel.py (1 hunks)
src/spatial/views/satellite_predictions.py (1 hunks)

🧰 Additional context used

🪛 Ruff

src/spatial/views/satellite_predictions.py

3-3: gcsfs imported but unused

Remove unused import: gcsfs

(F401)

4-4: joblib imported but unused

Remove unused import: joblib

(F401)

33-33: Undefined name project_name

(F821)

33-33: Undefined name bucket_name

(F821)

33-33: f-string without any placeholders

Remove extraneous f prefix

(F541)

🪛 GitHub Check: CodeQL

src/spatial/views/satellite_predictions.py

[warning] 44-44: Information exposure through an exception
Stack trace information flows to this location and may be exposed to an external user.

🔇 Additional comments (4)

src/spatial/controllers/controllers.py (1)

7-7: Excellent addition of the SatellitePredictionView import.

The new import statement is well-placed and follows the existing naming conventions. It's a necessary addition to support the new satellite prediction functionality.
src/spatial/models/SatellitePredictionModel.py (1)
7-7: Verify the availability of the 'configure' module

The import statement from configure import Config, satellite_collections assumes the presence of a configure module containing Config and satellite_collections. Please ensure that this module exists in your project and is correctly set up.

To confirm the existence of configure.py, you can run:

✅ Verification successful

Re-attempting to Verify the 'configure' Module

Let's try a corrected command to locate configure.py:

Verifying Definitions in configure.py Modules

Let's identify which configure.py contains the Config class and satellite_collections:

'configure' Module Verification Successful

The configure module is present in src/spatial/configure.py and contains both Config and satellite_collections. The import statement in SatellitePredictionModel.py is correctly referencing this module.

Module Path: src/spatial/configure.py

Contains:

Config class

satellite_collections variable
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:
#!/bin/bash
# Description: Check for the presence of 'configure.py' in the project directory.

# Test: Search for 'configure.py'. Expected output: Path to 'configure.py'.
fd --type f --name 'configure.py'
Length of output: 284

Script:
#!/bin/bash
# Description: Check for the presence of 'configure.py' in the project directory.

# Test: Search for 'configure.py'. Expected output: Path to 'configure.py'.
fd -t f 'configure.py'
Length of output: 104

Script:
#!/bin/bash
# Description: Verify that 'Config' and 'satellite_collections' are defined in the correct 'configure.py'.

# Search for the 'Config' class
grep -H "class Config" src/predict/jobs/predict_places_air_quality/configure.py src/spatial/configure.py

# Search for the 'satellite_collections' variable
grep -H "satellite_collections" src/predict/jobs/predict_places_air_quality/configure.py src/spatial/configure.py
Length of output: 381
src/spatial/configure.py (2)

47-47: Using fallback configuration enhances stability

Assigning configuration = app_config.get(environment, "staging") ensures that we have a default configuration if the specified environment is not found. This is a good approach to maintain application stability.

49-135: Verify the correctness of satellite data fields

It's important to ensure that all parameters within the satellite_collections dictionary are accurate and correspond to the actual data fields provided by the satellite data sources. Please double-check these entries to prevent any potential data retrieval issues.

To assist with verification, you can run the following script to cross-reference the fields with the available datasets:

Please adjust the script according to the tools available in our environment.

coderabbitai · 2024-10-10T07:17:24Z

src/spatial/views/satellite_predictions.py

+            prediction =  get_trained_model_from_gcs(
+            project_name, bucket_name, f"satellite_model.pkl"
+        ).predict(feature_array)[0]


⚠️ Potential issue

Undefined Variables: project_name and bucket_name

The variables project_name and bucket_name used in get_trained_model_from_gcs are not defined within the scope of this function. Ensure that these variables are properly defined or passed as parameters to avoid NameError exceptions.

Would you like assistance in defining these variables or retrieving them from your configuration?

🧰 Tools

🪛 Ruff

33-33: Undefined name project_name

(F821)

33-33: Undefined name bucket_name

(F821)

33-33: f-string without any placeholders

Remove extraneous f prefix

(F541)

coderabbitai · 2024-10-10T07:17:24Z

src/spatial/views/satellite_predictions.py

+import gcsfs
+import joblib


⚠️ Potential issue

Remove Unused Imports: gcsfs and joblib

It appears that gcsfs and joblib are imported but not used in this file. Removing unused imports helps keep the code clean and maintainable.

Apply this diff to remove the unused imports:

- import gcsfs - import joblib

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

import gcsfs

import joblib

🧰 Tools

🪛 Ruff

3-3: gcsfs imported but unused

Remove unused import: gcsfs

(F401)

4-4: joblib imported but unused

Remove unused import: joblib

(F401)

coderabbitai · 2024-10-10T07:17:25Z