merge upstream #2

francislabountyjr · 2024-06-04T04:17:33Z

No description provided.

Make the code render as Python.

Signed-off-by: Sherlock113 <[email protected]>

The `enum` keyword is reserved for arrays of constants. When we have callables as members of a Python `Enum`, we get an object whose properties are the arguments of the callable. The `Enum` thus needs to be converted into a `oneOf` field.

The current link points to a HTML instead of a PNG, so after it's downloaded via requests.get, PIL isn't able to recognize the format and rightly so.

@dgerlanc

This PR addresses #1335 I have kept the `SamplingParameters` in `outlines/generate/api.py` cause its imported at many places and may break examples ig. Would you want to change that @dgerlanc ?? Otherwise if this looks good, I can make docs changes if needd, lmk.

Hi, Thank you for this great library! It seems that the docstrings are not rendered correctly in the docs. I think we should explicitly set the `docstring_style` because [it defaults to `"google"`](https://mkdocstrings.github.io/python/usage/configuration/docstrings/#docstring_style) but outlines is using numpy. Before: ![Screenshot 2024-12-16 at 23 00 26](https://github.com/user-attachments/assets/c752ee3d-519e-4098-b943-3aab43c8af25) After: ![Screenshot 2024-12-16 at 23 00 41](https://github.com/user-attachments/assets/5b5f524b-6921-4dbe-994d-72c079e677bc) There seem to be other issues in the docstrings: - for example [`Properties`](https://github.com/dottxt-ai/outlines/blob/main/outlines/models/openai.py#L23) should be [`Attributes`](https://numpydoc.readthedocs.io/en/latest/format.html#parameters) - only openai and transformers models are present in the [api reference](https://github.com/dottxt-ai/outlines/blob/main/docs/api/models.md) I'm happy to make followup PRs for those. Please let me know if I missed something, I couldn't find related issues/PRs.

Fix #1356

Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks

the old library structure has not been updated to reflect the present one.

Before this commit, when you ran pytest -k specific_test, it spawned dozens of the same skipped warnings message on stdout... IMHO, that was not ideal ^^ Bug introduced in d32dfde

Allow giving custom filters to the prompt decorator ``` def reverses: str) -> str: return s[::-1] @prompt(filters={ 'reverse': reverse }) def reverse_prompt(text): '''{{ text | reverse }}''' prompt = reverse_prompt("Hello") print(prompt) >>> "olleH" ```

There's an extra `outlines.generate` row in the feature matrix docs. This removes it. I also modified the markdown syntax for one header to use ** rather than __, consistent with the rest of the table.

We have noticing the following error with a recent version of outlines when used with MLX: ``` TypeError: argument 'token_id': 'float' object cannot be interpreted as an integer At: /.../outlines_core/fsm/guide.py(294): get_next_state /.../outlines/processors/structured.py(101): process_logits /.../outlines/processors/base_logits_processor.py(90): __call__ ``` The issue is that the MLX array of tokens, which are integers, are being force-converted to floats, even though outlines expects an integer array. This is because all MLX arrays are being converted to `float32`, even when it's not necessarily appropriate, like in this case. Looking at the [commented link](https://ml-explore.github.io/mlx/build/html/usage/numpy.html#pytorch), the advice was to convert to `float32` only for `bfloat16`, because numpy does not support `bfloat16`. Now the MLX `_to_torch` implementation matches the other array libraries, none of the other libraries are being force-casted to float

rlouf and others added 30 commits April 15, 2024 14:54

Add documentation vLLM for multiple GPUs

f2af45a

Add torch dependency in installation instructions

98dd9dd

Add DirectMerge article

0bb6d1d

Add cookbook to run Outlines on Modal

895a82e

Make torch import optional

f6e5523

Add Outlines twitter account

2182dbc

Exclude escape character in JSON string fields

06d5654

Add tweets about Outlines

8208133

#824 model_name as optional parameter for azure_openai

92d79a1

Force dates, uuid, datetimes, times to be between quotes

d548d92

Remove unnecessary vocabulary copies in RegexGuide

4d6ec1f

Fix typo in llama.cpp documentation

19eb0f0

add azure_openai to models __init__.py

fc5e342

follow isort

078f822

Add phone number and zip code custom types

1eb9a04

Add cookbook to run Outlines with BentoML and BentoCloud

9188eff

Add Gigax project to community

f5c43e0

Add HF evaluation article

4934425

Remove the need to copy all tokens during basic generation

164d1f0

Add HF article to README

d6a2b79

Add ISBN type

ec664fe

Add IATA code type

1053e7d

Add country types

51064f3

Fix typo in docs (#860)

842ef19

Update README.md

a101c1c

Fix code rendering (#864)

b3b29fe

Make the code render as Python.

ignore import warnings from huggingface_hub & pyairports

db83c08

Fix format in the BentoML doc

b7d876e

Signed-off-by: Sherlock113 <[email protected]>

fix CFG Generation

353cebb

Localize types

4f8433d

rlouf and others added 30 commits December 9, 2024 16:02

Convert enum of objects to oneOf with consts

5113f94

The `enum` keyword is reserved for arrays of constants. When we have callables as members of a Python `Enum`, we get an object whose properties are the arguments of the callable. The `Enum` thus needs to be converted into a `oneOf` field.

Fail as soon as one test fails

147b03e

Fix conversion from MLX to torch tensor

fb43f4f

Bump outlines-core to 0.1.24

9df3b1b

Bump outlines-core version to 0.1.25

05bcb4b

Bump outlines-core version to 0.1.26

5c41fb3

Fix image link in receipt-digitization example

365b566

The current link points to a HTML instead of a PNG, so after it's downloaded via requests.get, PIL isn't able to recognize the format and rightly so.

Fix unresolved imports in transformers doc

061f2ba

Fix JSON structured import

6205c0b

Update mkdocs.yml: jump to mamba section

fef70c0

Update transformers.md: fix headings level for mkdocs toc

5920870

remove non_blocking=True

fddfc8f

optimize tensor creation

2f0740e

add mps to processor benchmark

6a8612b

Skip vllm-related tests on non-linux platforms

d32dfde

Fix #1356

Fix json_schema imports in cot example

3cc399d

Add from_file class method to the Prompt object

4456f3c

Remove prompt.render method

bf27c77

Apply a small fix suggested by linter: Ruff E721

7149a5e

Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks

Update react_agent.md with current import structure

e7be562

the old library structure has not been updated to reflect the present one.

Update transformers_vision.md

03d3878

Fix vllm-related pytest warning (that was spaming user)

3af1e5a

Before this commit, when you ran pytest -k specific_test, it spawned dozens of the same skipped warnings message on stdout... IMHO, that was not ideal ^^ Bug introduced in d32dfde

Remove incorrect text from feature matrix docs

a03c254

There's an extra `outlines.generate` row in the feature matrix docs. This removes it. I also modified the markdown syntax for one header to use ** rather than __, consistent with the rest of the table.

Update vllm_integration.py to the latest version of outlines (#1352)

7b9012b

Fix typo in docs/community/examples

8b173a3

Fix readme to reflect routing through enum

063291d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

merge upstream #2

merge upstream #2

francislabountyjr commented Jun 4, 2024

merge upstream #2

Are you sure you want to change the base?

merge upstream #2

Conversation

francislabountyjr commented Jun 4, 2024