* DEPR: Change boxplot return_type kwarg
Part of https://github.com/pydata/pandas/issues/6581
Deprecation started in https://github.com/pydata/pandas/pull/7096
Changes the default value of `return_type` in DataFrame.boxplot
and DataFrame.plot.box from None to 'axes'.
* API: Change faceted boxplot return_type
Aligns behavior of `Groupby.boxplot` and DataFrame.boxplot(by=.)
to return a Series.
closes#13107
Author: sinhrks <sinhrks@gmail.com>
Closes#13986 from sinhrks/dti_equals and squashes the following commits:
580151a [sinhrks] BUG/CLN: move .equals to DatetimeOpsMixin
closes#8725 Changes `get_dummies` return columns with `uint8`
dtypes instead of coercing to floats if they were alongside other
float columns.
Author: Tom Augspurger <tom.augspurger88@gmail.com>
Closes#13796 from TomAugspurger/get_dummies_dtype and squashes the following commits:
cace0f7 [Tom Augspurger] BUG: int dtype for get_dummies
Title is self-explanatory. Closes#6618.
Author: gfyoung <gfyoung17@gmail.com>
Closes#14132 from gfyoung/to-csv-newline and squashes the following commits:
d1a600f [gfyoung] BUG: Don't print stray newline with MultiIndex
For a nonexistent file, raise the more specific
FileNotFoundError for Python >= 3.3 in read_csv,
read_table, and read_hdf.
This error is backported to Python 2.x as IOError.
Closes gh-14086.
* TST: assert indexing with list of Periods works (GH7710)
* TST: assert median for timedelta with NaT works (GH8617)
* TST: assert no conversion to float when resampling datetime64 values (13119)
The system file encoding can cause a separator to be encoded as more
than one character even though it maybe provided as one character.
Multi-char separators are not supported by the C engine, so we need to
catch this case. Closes#14065.
Author: gfyoung <gfyoung17@gmail.com>
Closes#14120 from gfyoung/multi-char-encoded and squashes the following commits:
152b685 [gfyoung] API: Warn or raise for > 1 char encoded sep
When a thousands parameter is specified, if the index column data
contains that thousands value for date purposes (e.g. '.'), do not
interpret those characters as the thousands parameter.
Closes gh-14066.
closes#14021
Author: John Liekezer <cool.Bakov@yandex.ru>
Closes#14051 from conquistador1492/issue_14021 and squashes the following commits:
a405777 [John Liekezer] BUG: Dataframe.fillna with np.nan for dtype=category(GH 14021)
closes#13509
Author: Ben Kandel <ben.kandel@gmail.com>
Closes#14092 from bkandel/fix-floatindex-tuple and squashes the following commits:
2b77554 [Ben Kandel] Fix bug in Float64Index.get_value() for tuples.
closes#13977
Author: Kernc <kerncece@gmail.com>
Closes#13978 from kernc/iterrows-with-constructor-sliced and squashes the following commits:
9aaac80 [Kernc] BUG: yield correct Series subclass in df.iterrows() (#13977)
f8f4230 [Kernc] DOC: tm.assert_series_equal() fix docstring default values
- series comparison operator to check whether labels are identical (currently: ignores labels)
- series boolean operator to align with labels (currently: only keeps left index)
split from #13941 (comment)
Author: sinhrks <sinhrks@gmail.com>
Closes#13988 from sinhrks/period_values and squashes the following commits:
d7637c9 [sinhrks] API: PeriodIndex.values now return array of Period objects
Update documentation to state that scalars are accepted for
na_values. In addition, accept scalars for the values when a
dictionary is passed in for na_values.
Closes gh-12224.
closes#13109closes#13135
The precise strategy to be taken here is open for
discussion. I tried to be reasonably fine-grained rather than slap a
generic decorator over everything because it's easier to go that
direction than the reverse. The `errstate()` blocks in the tests were
added *after* fixing all of the library code. Unfortunately, these are
less fine-grained than I would like because some of the tests have
many lines of the form `assert_array_equal(pandas_expression_to_test,
expected_raw_numpy_expression)` where `expected_raw_numpy_expression`
is what is triggering the warning. It was tedious to try to rewrite
all of that to wrap just `expected_raw_numpy_expression`. I think I
got everything exercised by the test suite except for parts of the
test suite that are skipped on my machine due to dependencies. We'll
see how things go in the CI. I haven't added any new tests yet.
Could do if requested.
Author: Robert Kern <rkern@enthought.com>
Author: Robert Kern <robert.kern@gmail.com>
Closes#13145 from rkern/fix/errstate and squashes the following commits:
ef9c001 [Robert Kern] BUG: whoops, wrong function.
7fd2e86 [Robert Kern] ENH: More whatsnew documentation.
44805db [Robert Kern] ENH: Rearrange expression to avoid generating a warning that would need to be silenced.
1fe1bc2 [Robert Kern] pep8
bf1f662 [Robert Kern] BUG: New fixes after master rebase.
e7adc03 [Robert Kern] BUG: wrong function.
a59cfa7 [Robert Kern] ENH: Avoiding the bounds error is better than silencing the warning.
0e1ea81 [Robert Kern] BUG: A few more stragglers.
863ac93 [Robert Kern] TST: Add a new test to ensure that boolean comparisons are errstate-protected.
6932851 [Robert Kern] TST: Basic check that the global errstate remains unchanged.
c9df7b3 [Robert Kern] BUG: removed debugging print
3b12f08 [Robert Kern] ENH: Silence numpy warnings from certain expressions computed during tests.
eca512c [Robert Kern] BUG: Handle NaT explicitly.
6fbc9ce [Robert Kern] BUG: First pass at fine-grained errstate.
Because #13985 has been fixed, ``SparseSeries`` comparison op now can return
``SparseSeries`` (it returns normal ``Series`` on current master).
also fixed a bug when ``SparseArray`` created from ``SparseSeries``
may not inherit ``dtype``.
Author: sinhrks <sinhrks@gmail.com>
Closes#13999 from sinhrks/sparse_comparison and squashes the following commits:
eafc94c [sinhrks] API: SparseSeries comparison now returns sparse
closes#13967
Author: OXPHOS <engelzora@gmail.com>
Closes#14030 from OXPHOS/excel_na and squashes the following commits:
ff1964b [OXPHOS] GH13967: move around _NA_VALUES and add doc for read_csv().na_values