DOC: Update performance comparison section of io docs #28890

WuraolaOyewusi · 2019-10-10T07:04:29Z

xref python-sprints/pandas-mentoring#163

sync

WuraolaOyewusi · 2019-10-10T07:06:11Z

Hi Marc @datapythonista

doc/source/user_guide/io.rst

jreback · 2019-10-12T17:16:02Z

doc/source/user_guide/io.rst

@@ -5593,164 +5593,166 @@ Given the next test set:

 .. code-block:: python

-   from numpy.random import randn
+    from numpy.random import randn


can you change this example to use our formatting, meaning don import like this, rather use
np.random.randn directory, and np.random.seed

WuraolaOyewusi · 2019-10-20T19:44:46Z

Hi Marc @datapythonista .
Kindly check this PR?

datapythonista

Thanks @WuraolaOyewusi

The diff is a bit difficult to check, but looks good. You just fixed the indentation, besides rerunning and addressing the review comments, right?

@jorisvandenbossche if you want to have a look.

jorisvandenbossche · 2019-10-21T07:06:56Z

I restored the indentation for a moment, to have an easier diff (but can undo that in the end, as a 4-space indent is easier to work with)

jorisvandenbossche

In general looks good, just noticed one problem with the hdf compression

jorisvandenbossche · 2019-10-21T07:08:31Z

doc/source/user_guide/io.rst

+    24009288 Oct 10 06:43 test_fixed.hdf
+    24009288 Oct 10 06:43 test_fixed_compress.hdf
+    24458940 Oct 10 06:44 test_table.hdf
+    24458940 Oct 10 06:44 test_table_compress.hdf


I just noticed this here: it seems something went wrong with the compression (as it is exactly the same size as the non-compressed one; and also the timing is not slower). Maybe it did fall back to non-compressed because you didn't have the compression lib installed?
(however, if it does that silently, that feels like a bug to me)

@jorisvandenbossche I found out the original code had 3-space indentation, aligning my update to the previous code was the reason some checks failed. When I made the indentation 4-space. It passed.

Let me check the notebook again about the compression.

You are right about the compression.

@jorisvandenbossche, @datapythonista

I just noticed this here: it seems something went wrong with the compression (as it is exactly the same size as the non-compressed one; and also the timing is not slower). Maybe it did fall back to non-compressed because you didn't have the compression lib installed?
(however, if it does that silently, that feels like a bug to me)

I ran the codes again, tried version '0.25.0' and it's still the same. It seems like a bug.
What can I do to fix it?

Can you open a separate issue, referencing this issue, explaining about the lack of compression in hdf.

@jorisvandenbossche probably worth merging this as is, and fix that in a separate PR, since it's an unrelated change. And looks like a bug in the code, and I guess the fix won't be trivial.

jreback · 2019-10-31T13:12:42Z

lgtm. @jorisvandenbossche

WillAyd · 2019-11-08T16:56:07Z

@WuraolaOyewusi can you fix up the merge conflict? I think we can merge this in once complete

…O-docs

WuraolaOyewusi · 2019-11-08T18:24:23Z

@WillAyd
Done

WillAyd · 2019-11-09T01:00:05Z

Great thanks @WuraolaOyewusi keep em coming

…ndexing-1row-df * upstream/master: (109 commits) stronger typing in libreduction (pandas-dev#29502) API: rename labels to codes (pandas-dev#29509) CLN: remove unnecessary type checks (pandas-dev#29517) implement _BaseGrouper (pandas-dev#29520) CLN: F-string formatting in pandas/_libs/*.pyx (pandas-dev#29527) Fixed more SS03 errors (pandas-dev#29540) consolidate dim checks (pandas-dev#29536) REF: separate out _get_cython_func_and_vals (pandas-dev#29537) remove unnecessary exception (pandas-dev#29538) TST:Add test to check single category col returns series with single row slice (pandas-dev#29521) Make color validation more forgiving (pandas-dev#29122) DOC: update bottleneck repo and documentation urls (pandas-dev#29516) TST: add test for df construction from dict with tuples (pandas-dev#29497) add test for pd.melt dtypes preservation (pandas-dev#29510) updated DataFrame.equals docstring (pandas-dev#29496) Resolved merge conflicts (pandas-dev#29506) DOC: Improved pandas/compact/__init__.py (pandas-dev#29507) DOC: Update performance comparison section of io docs (pandas-dev#28890) TST: add test for df.where() with category dtype (pandas-dev#29454) DOC: Fix docs on merging categoricals. (pandas-dev#28185) ...

WuraolaOyewusi added 7 commits August 21, 2019 11:34

Merge pull request #1 from pandas-dev/master

4e85c6d

sync

Merge pull request #2 from pandas-dev/master

44df2ee

sync

Merge pull request #3 from pandas-dev/master

b887983

sync

Merge pull request #4 from pandas-dev/master

9554ea6

sync

Merge pull request #5 from pandas-dev/master

fd27a6f

sync

Merge pull request #6 from pandas-dev/master

3425a0a

sync

Update io.rst

e53bce0

WuraolaOyewusi added 12 commits October 10, 2019 08:10

Update io.rst

76ccef3

Update io.rst

9672526

Update io.rst

d2c1e20

Update io.rst

709d571

Update io.rst

ddd39f6

Update io.rst

26b5db1

Update io.rst

cf85f95

Update io.rst

3d71d40

Update io.rst

8c8ed93

Update io.rst

3e62c8f

Update io.rst

1af539c

Update io.rst

ce51d5e

jreback reviewed Oct 11, 2019

View reviewed changes

doc/source/user_guide/io.rst Outdated Show resolved Hide resolved

jreback added the Docs label Oct 11, 2019

jreback added this to the 1.0 milestone Oct 11, 2019

Update io.rst

524c7e0

jreback reviewed Oct 12, 2019

View reviewed changes

Update io.rst

2b77c5d

datapythonista approved these changes Oct 20, 2019

View reviewed changes

restore indentation

2224738

jorisvandenbossche changed the title ~~Update performance comparison section of io docs~~ DOC: Update performance comparison section of io docs Oct 21, 2019

fixup

df377c1

jorisvandenbossche requested changes Oct 21, 2019

View reviewed changes

WuraolaOyewusi mentioned this pull request Nov 1, 2019

HDF file compression not working #29310

Open

WuraolaOyewusi added 3 commits November 8, 2019 19:06

Update io.rst

e3eba95

Update io.rst

3aa5dea

Merge branch 'master' into Update-Performance-Comparison-section-of-I…

0af75a0

…O-docs

WillAyd merged commit 6498bc1 into pandas-dev:master Nov 9, 2019

Reksbril pushed a commit to Reksbril/pandas that referenced this pull request Nov 18, 2019

DOC: Update performance comparison section of io docs (pandas-dev#28890)

45afadb

proost pushed a commit to proost/pandas that referenced this pull request Dec 19, 2019

DOC: Update performance comparison section of io docs (pandas-dev#28890)

c4e5daa

proost pushed a commit to proost/pandas that referenced this pull request Dec 19, 2019

DOC: Update performance comparison section of io docs (pandas-dev#28890)

397e4b9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC: Update performance comparison section of io docs #28890

DOC: Update performance comparison section of io docs #28890

WuraolaOyewusi commented Oct 10, 2019 •

edited by datapythonista

Loading

WuraolaOyewusi commented Oct 10, 2019

jreback Oct 12, 2019

WuraolaOyewusi Oct 12, 2019

WuraolaOyewusi commented Oct 20, 2019 •

edited

Loading

datapythonista left a comment

jorisvandenbossche commented Oct 21, 2019

jorisvandenbossche left a comment

jorisvandenbossche Oct 21, 2019

WuraolaOyewusi Oct 21, 2019

WuraolaOyewusi Oct 21, 2019

WuraolaOyewusi Oct 29, 2019

datapythonista Oct 29, 2019

WuraolaOyewusi Oct 31, 2019

jreback commented Oct 31, 2019

WillAyd commented Nov 8, 2019

WuraolaOyewusi commented Nov 8, 2019

WillAyd commented Nov 9, 2019

DOC: Update performance comparison section of io docs #28890

DOC: Update performance comparison section of io docs #28890

Conversation

WuraolaOyewusi commented Oct 10, 2019 • edited by datapythonista Loading

WuraolaOyewusi commented Oct 10, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

WuraolaOyewusi commented Oct 20, 2019 • edited Loading

datapythonista left a comment

Choose a reason for hiding this comment

jorisvandenbossche commented Oct 21, 2019

jorisvandenbossche left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jreback commented Oct 31, 2019

WillAyd commented Nov 8, 2019

WuraolaOyewusi commented Nov 8, 2019

WillAyd commented Nov 9, 2019

WuraolaOyewusi commented Oct 10, 2019 •

edited by datapythonista

Loading

WuraolaOyewusi commented Oct 20, 2019 •

edited

Loading