New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Information extraction from Cheshire fire incident reports using Mistral #2201

Merged

priyankatuteja merged 1 commit into next from NER_Using_Mistral

Jan 31, 2025

Collaborator

SurajBaloni commented Jan 17, 2025 •

edited

Loading

This PR adds sample notebook on - Information extraction from Cheshire fire incident reports using Mistral language model.

Table Of Contents location:
Deep Learning > NLP > Information extraction from Cheshire fire incident reports using Mistral


          Information extraction from Cheshire fire incident reports using Mistral

951bfe4

review-notebook-app bot commented Jan 17, 2025

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

jyaistMap mentioned this pull request

Information extraction from Cheshire fire incident reports using Mistral #2115

Open

SurajBaloni requested review from priyankatuteja, kapil-varshney and BP-Ent

January 28, 2025 04:57

Collaborator Author

SurajBaloni commented Jan 28, 2025

@BP-Ent Please review this sample notebook

jyaistMap approved these changes

View reviewed changes

Collaborator

jyaistMap left a comment •

edited

Loading

looks good visually from my review, but cannot speak to technical accuracy
local build of doc appears like this for half of doc, rest of it looks good
the right-hand navigation all works

Collaborator

jyaistMap commented Jan 28, 2025

@priyankatuteja

when you approve this, I can merge python-api-doc #288 to update the Table of Contents to reflect new sample

BP-Ent reviewed

View reviewed changes

...nformation_extraction_from_cheshire_fire_incident_reports_using_mistral_language_model.ipynb

    
            @@ -0,0 +1,1208 @@
          
              {

Collaborator

BP-Ent Jan 28, 2025 •

edited

Loading

Refer to the section Install deep learning dependencies of the arcgis.learn module for detailed documentation on installing the dependencies.

To learn more about how EntityRecognizer works, please refer to the guide on Named Entity Extraction with arcgis.learn.

Reply via ReviewNB

...nformation_extraction_from_cheshire_fire_incident_reports_using_mistral_language_model.ipynb

    
            @@ -0,0 +1,1208 @@
          
              {

Collaborator

BP-Ent Jan 28, 2025 •

edited

Loading

Data preparation involves splitting the data into training and validation sets, creating the necessary data structures for loading data into the model. The prepare_data() function can directly read the training samples in one of the above specified formats and automate the entire process.

Reply via ReviewNB

...nformation_extraction_from_cheshire_fire_incident_reports_using_mistral_language_model.ipynb

    
            @@ -0,0 +1,1208 @@
          
              {

Collaborator

BP-Ent Jan 28, 2025 •

edited

Loading

The EntityRecognizer model in arcgis.learn can be used with Hugging Face Transformers or with large language model backbones. For this sample use case, we will use the Mistral model backbone to extract entities from the text.

Run the command below to see what backbones are supported for the entity recognition task.

Reply via ReviewNB

...nformation_extraction_from_cheshire_fire_incident_reports_using_mistral_language_model.ipynb

    
            @@ -0,0 +1,1208 @@
          
              {

Collaborator

BP-Ent Jan 28, 2025 •

edited

Loading

First we will create the model using the EntityRecognizer() constructor and passing the following parameters:

data: The databunch created using the prepare_textdata method.

backbone: To use mistral as the model backbone, use backbone="mistral".

prompt: Text string describing the task and its guardrails. This is an optional parameter.

Reply via ReviewNB

...nformation_extraction_from_cheshire_fire_incident_reports_using_mistral_language_model.ipynb

    
            @@ -0,0 +1,1208 @@
          
              {

Collaborator

BP-Ent Jan 28, 2025 •

edited

Loading

Important metrics to look at while measuring the performance of the EntityRecognizer model are Precision, Recall, and F1-measures.

Reply via ReviewNB

...nformation_extraction_from_cheshire_fire_incident_reports_using_mistral_language_model.ipynb

    
            @@ -0,0 +1,1208 @@
          
              {

Collaborator

BP-Ent Jan 28, 2025 •

edited

Loading

To find precision, recall, and f1 scores per label/class, we will call the model's metrics_per_label() method.

Reply via ReviewNB

...nformation_extraction_from_cheshire_fire_incident_reports_using_mistral_language_model.ipynb

    
            @@ -0,0 +1,1208 @@
          
              {

Collaborator

BP-Ent Jan 28, 2025 •

edited

Loading

Now that we have the trained model, let's look at how the model performs.

Reply via ReviewNB

...nformation_extraction_from_cheshire_fire_incident_reports_using_mistral_language_model.ipynb

    
            @@ -0,0 +1,1208 @@
          
              {

Collaborator

BP-Ent Jan 28, 2025 •

edited

Loading

The load() method takes the path to the .emd file as a required argument.

Reply via ReviewNB

...nformation_extraction_from_cheshire_fire_incident_reports_using_mistral_language_model.ipynb

    
            @@ -0,0 +1,1208 @@
          
              {

Collaborator

BP-Ent Jan 28, 2025 •

edited

Loading

Now we can use the trained model to extract entities from new text documents using the extract_entities() method. This method expects either the folder path of where the new text documents are located or a list of text documents.

Reply via ReviewNB

BP-Ent previously requested changes

View reviewed changes

Collaborator

BP-Ent left a comment

Suggestions made on reviewnb

priyankatuteja dismissed BP-Ent’s stale review

January 31, 2025 04:42

done

priyankatuteja merged commit c4959d0 into next

2 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet