Skip to content

Commit

Permalink
[ADD] base_import_pdf_simple: New module
Browse files Browse the repository at this point in the history
TT48213
  • Loading branch information
victoralmau committed May 30, 2024
1 parent 4465c6e commit d115a92
Show file tree
Hide file tree
Showing 33 changed files with 3,452 additions and 0 deletions.
190 changes: 190 additions & 0 deletions base_import_pdf_simple/README.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,190 @@
======================
Base Import Pdf Simple
======================

..
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
!! This file is generated by oca-gen-addon-readme !!
!! changes will be overwritten. !!
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
!! source digest: sha256:04414b052ed70db05b830c912a89562becbfa0e319073ddff9a663e855a0a954
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
.. |badge1| image:: https://img.shields.io/badge/maturity-Beta-yellow.png
:target: https://odoo-community.org/page/development-status
:alt: Beta
.. |badge2| image:: https://img.shields.io/badge/licence-AGPL--3-blue.png
:target: http://www.gnu.org/licenses/agpl-3.0-standalone.html
:alt: License: AGPL-3
.. |badge3| image:: https://img.shields.io/badge/github-OCA%2Fedi-lightgray.png?logo=github
:target: https://github.com/OCA/edi/tree/15.0/base_import_pdf_simple
:alt: OCA/edi
.. |badge4| image:: https://img.shields.io/badge/weblate-Translate%20me-F47D42.png
:target: https://translation.odoo-community.org/projects/edi-15-0/edi-15-0-base_import_pdf_simple
:alt: Translate me on Weblate
.. |badge5| image:: https://img.shields.io/badge/runboat-Try%20me-875A7B.png
:target: https://runboat.odoo-community.org/builds?repo=OCA/edi&target_branch=15.0
:alt: Try me on Runboat

|badge1| |badge2| |badge3| |badge4| |badge5|

This module allows you to import PDF files and generate records based on the data
contained in those PDF files.
It also allows you to define a pattern that indicates how to recognize and extract
the data from the PDF to generate a record.

**Table of contents**

.. contents::
:local:

Configuration
=============

To configure a PDF document template for import, the first thing to do is to have the
document defined with a specific structure.

#. Go to Settings > Technical > Base Import PDF Simple > Templates
#. Create a new template by entering a characteristic name and the model on which
the record will be generated.

Fields to consider completing on template:

- Main Model: model on which the record will be generated. Example: purchase.order
- Child field: One2many field that will create records from selected template.
Example: Order Lines (purchase.order)
- Auto detect pattern: Define a characteristic pattern of the document so that
it recognizes that it corresponds to the template we are creating. Need to use
regular expression. Example: (?<=ESA79935607)[\S\s]*
- Header Items: Complete this field if the template has a header table to extract
information lines. Example: Reference,Quantity,Price
- Company: Set the company that will use the template. If it is empty, template
will apply for all companies set on the environment.

#. Add new lines.

- Related model: When adding new line, the section where to locate the data; "header"
which, as its name indicates, refers to the header of the document and "lines" refers
to the structure of lines or table of the document.
- Field: Map the field to be completed. Example: product
- Pattern: Optional field to complete. Define pattern of the document so that it
recognizes the place to get the field selected on PDF template. Need to use regular
expression. Example: ([0-9]{7}) [0-7]{1}
- Value type:
- Fixed: Select this value, if the field mapped will always have an specific
value and not extract the information from template. In this case Pattern field
must be empty.
- Variable: Select variable to get the information from template. In this case,
Pattern field must be completed.
- For Value type "Variable" will appear extra fields to complete:
- Search value: Indicates the field by which the value obtained in the PDF will
be searched on the system.
- Default value: If the search result is empty for the search value option, you
can set default value to create a record and not getting error message.
- Log distint value?: This option is useful when getting prices in order to
compare prices inside system and prices obtained from PDF. This will create lines
with prices obtained from the system but create log on chatter to see the
differences obtained from PDF.

Check demo data to further information.

Usage
=====

This module allows to upload PDF files in any Odoo model. It processes each of the files
and converts it into a new record.
Technically, the pdf is transformed into text and that text is processed to create the
record.
The module incorporates an option in Favorites Import PDF and Template configuration in
order to recognized any document structure.

Known issues / Roadmap
======================

- Add operator in template lines (= or ilike)
- Add support for selection fields as default value.
- Simplify auto-detection (defining a text only to search the system should search the
corresponding regular expression).
- Allow compatibility with registration process created from email alias (for purchase
order for example).
- Remove error if some file is not auto-detected template, options: boolean (default
option according to system parameter) to omit error for not found files or change
process to 2 steps, auto-detect and show lines (each one with respect to a file) with
template applied (similar to dms_auto_classification).
- Create test_base_import_pdf_simple module with sale, purchase and account dependencies
to leave templates created in runboat and tests more useful for testers.
- Display a more readable error if there is an error in Preview process, example: wrong
pattern. Message: "Please check template defined, some items are not correctly set".
- Add a progress bar (widget=“gauge”) in the import wizard process, useful if we import
for example sales orders with 20 lines and thus know the progress.

Compatibility with csv, xls, etc:

- Separate much of the logic to new module base_import_simple that would contain the logic
of templates, type of files (csv, excel, etc) in the templates and wizard and this module
would depend on the other adding only what relates to PDF.
- The base module should take into account for each template whether each line is a new
record or not, and start line (in case you want to omit any), only page 1 would be imported.
- The preview smart-btton would serve exactly the same purpose.
- In the case of csv and Excel that each record is a line, the document will NOT be attached
to the record.
- If you indicate that each record is a line the column will be the key, otherwise you must
specify to which line each line of the template refers.
- In the case of csv it will try to auto-detect the lines and columns (no need to complicate
delimiters configuration).
- The menu "Import PDF" of the favorite menu would become "Import file", and the allowed file
extensions would be those obtained from a method (it would be extended by other modules that
add other formats such as PDF).
- Add queue_job_base_import_simple module to process everything by queues (example: Excel
with hundreds of lines, each one a record).

Bug Tracker
===========

Bugs are tracked on `GitHub Issues <https://github.com/OCA/edi/issues>`_.
In case of trouble, please check there if your issue has already been reported.
If you spotted it first, help us to smash it by providing a detailed and welcomed
`feedback <https://github.com/OCA/edi/issues/new?body=module:%20base_import_pdf_simple%0Aversion:%2015.0%0A%0A**Steps%20to%20reproduce**%0A-%20...%0A%0A**Current%20behavior**%0A%0A**Expected%20behavior**>`_.

Do not contact contributors directly about support or help with technical issues.

Credits
=======

Authors
~~~~~~~

* Tecnativa

Contributors
~~~~~~~~~~~~

* `Tecnativa <https://www.tecnativa.com>`_:

* Víctor Martínez
* Pedro M. Baeza

Maintainers
~~~~~~~~~~~

This module is maintained by the OCA.

.. image:: https://odoo-community.org/logo.png
:alt: Odoo Community Association
:target: https://odoo-community.org

OCA, or the Odoo Community Association, is a nonprofit organization whose
mission is to support the collaborative development of Odoo features and
promote its widespread use.

.. |maintainer-victoralmau| image:: https://github.com/victoralmau.png?size=40px
:target: https://github.com/victoralmau
:alt: victoralmau

Current `maintainer <https://odoo-community.org/page/maintainer-role>`__:

|maintainer-victoralmau|

This module is part of the `OCA/edi <https://github.com/OCA/edi/tree/15.0/base_import_pdf_simple>`_ project on GitHub.

You are welcome to contribute. To learn how please visit https://odoo-community.org/page/Contribute.
2 changes: 2 additions & 0 deletions base_import_pdf_simple/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
from . import models
from . import wizards
34 changes: 34 additions & 0 deletions base_import_pdf_simple/__manifest__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,34 @@
# Copyright 2024 Tecnativa - Víctor Martínez
# License AGPL-3.0 or later (https://www.gnu.org/licenses/agpl).
{
"name": "Base Import Pdf Simple",
"version": "15.0.1.0.0",
"website": "https://github.com/OCA/edi",
"author": "Tecnativa, Odoo Community Association (OCA)",
"license": "AGPL-3",
"depends": ["mail"],
"installable": True,
"data": [
"security/ir.model.access.csv",
"security/security.xml",
"views/base_import_pdf_template_line_views.xml",
"views/base_import_pdf_template_views.xml",
"wizards/wizard_base_import_pdf_preview_views.xml",
"wizards/wizard_base_import_pdf_upload_views.xml",
],
"demo": [
"demo/base_import_pdf_template.xml",
],
"external_dependencies": {
"python": ["pypdf"],
},
"assets": {
"web.assets_backend": [
"base_import_pdf_simple/static/src/**/*.js",
],
"web.assets_qweb": [
"base_import_pdf_simple/static/src/**/*.xml",
],
},
"maintainers": ["victoralmau"],
}
84 changes: 84 additions & 0 deletions base_import_pdf_simple/demo/base_import_pdf_template.xml
Original file line number Diff line number Diff line change
@@ -0,0 +1,84 @@
<?xml version="1.0" encoding="utf-8" ?>
<odoo noupdate="1">
<record
id="demo_base_import_pdf_template_res_partner"
model="base.import.pdf.template"
>
<field name="name">Partner Template</field>
<field name="model_id" ref="base.model_res_partner" />
<field name="child_field_id" ref="base.field_res_partner__child_ids" />
<field name="header_items">Name,Address,Child Country</field>
<field name="auto_detect_pattern">Test partner info.*</field>
</record>
<record
id="demo_base_import_pdf_template_res_partner_header_01"
model="base.import.pdf.template.line"
>
<field name="template_id" ref="demo_base_import_pdf_template_res_partner" />
<field name="related_model">header</field>
<field name="field_id" ref="base.field_res_partner__name" />
<field name="pattern">Partner name:[\n] [\n](.*)</field>
</record>
<record
id="demo_base_import_pdf_template_res_partner_header_02"
model="base.import.pdf.template.line"
>
<field name="template_id" ref="demo_base_import_pdf_template_res_partner" />
<field name="related_model">header</field>
<field name="field_id" ref="base.field_res_partner__country_id" />
<field name="search_field_id" ref="base.field_res_country__code" />
<field name="pattern">[A-Z].* [(]([A-Z]{1,2})[)][\n]Industry</field>
</record>
<record
id="demo_base_import_pdf_template_res_partner_header_03"
model="base.import.pdf.template.line"
>
<field name="template_id" ref="demo_base_import_pdf_template_res_partner" />
<field name="related_model">header</field>
<field name="field_id" ref="base.field_res_partner__industry_id" />
<field name="search_field_id" ref="base.field_res_partner_industry__name" />
<field name="pattern">Industry:[\n] [\n](.*)</field>
</record>
<record
id="demo_base_import_pdf_template_res_partner_header_04"
model="base.import.pdf.template.line"
>
<field name="template_id" ref="demo_base_import_pdf_template_res_partner" />
<field name="related_model">header</field>
<field name="field_id" ref="base.field_res_partner__user_id" />
<field name="value_type">fixed</field>
<field name="fixed_value" ref="base.user_admin" />
</record>
<record
id="demo_base_import_pdf_template_res_partner_line_01"
model="base.import.pdf.template.line"
>
<field name="template_id" ref="demo_base_import_pdf_template_res_partner" />
<field name="related_model">lines</field>
<field name="field_id" ref="base.field_res_partner__name" />
<field name="column">0</field>
<field name="pattern">(.*),.*,</field>
</record>
<record
id="demo_base_import_pdf_template_res_partner_line_02"
model="base.import.pdf.template.line"
>
<field name="template_id" ref="demo_base_import_pdf_template_res_partner" />
<field name="related_model">lines</field>
<field name="field_id" ref="base.field_res_partner__street" />
<field name="column">1</field>
<field name="pattern">.*,(.*),</field>
</record>
<record
id="demo_base_import_pdf_template_res_partner_line_03"
model="base.import.pdf.template.line"
>
<field name="template_id" ref="demo_base_import_pdf_template_res_partner" />
<field name="related_model">lines</field>
<field name="field_id" ref="base.field_res_partner__country_id" />
<field name="search_field_id" ref="base.field_res_country__code" />
<field name="pattern">.*,.*, [A-Z].*[(]([A-Z]{1,2})[)]</field>
<field name="column">2</field>
<field name="log_distinct_value" eval="True" />
</record>
</odoo>
Loading

0 comments on commit d115a92

Please sign in to comment.