Tags: Bears-R-Us/arkouda
Tags
Closes #3339: Add multi-batch parquet read tests (#3350) * Closes #3339: Add multi-batch parquet read tests This PR (closes #3339) adds testing for parquet reads or arrays and strings large enough to trigger more than one batch. We also add testing of a segarray of segstrings containing empty segs and empty strings * add proto --------- Co-authored-by: Tess Hayes <stress-tess@users.noreply.github.com>
closes #2927 power divergence statistic (#2932) * closes #2927 power divergence statistic * add scipy to requirements * add arkouda/akstats/_stats_py.pyi * Fix F403 and F401 error codes on flake8 arkouda from arkouda/akmath/__init__.py and arkouda/akstats/__init__.py * un-pin scipy from specific version * add scipy license and minor changes in response to code review * Update tests/akmath/akmath_test.py --------- Co-authored-by: Amanda Potts <ajpotts@users.noreply.github.com> Co-authored-by: pierce <48131946+pierce314159@users.noreply.github.com>
Closes #2838: Expand dataframe merge functions to accept multiple col… …umns (#2848) This PR (closes #2838) expands the dataframe merge functions to act on multiple columns. When no value is provided for `on`, it defaults to the intersection of the columns of the left and right dataframe. `inner_join_merge` and `right_join_merge` were turned into helper functions that aren't exposed to the user to more closely match the pandas merge functionality where these are only avialble through `merge` Co-authored-by: Pierce Hayes <pierce314159@users.noreply.github.com>
Closes #2716: Add dataframe merge functionality (#2781) * add merge functionality * moving functionality to dataframe.py * remove numeric import * change exception error to TypeError * int col float behavior * remove extraneous code and fix type errors * change the float cast from np to ak * Update arkouda/dataframe.py Co-authored-by: pierce <48131946+pierce314159@users.noreply.github.com> * Update arkouda/dataframe.py Co-authored-by: pierce <48131946+pierce314159@users.noreply.github.com> * Update arkouda/dataframe.py Co-authored-by: pierce <48131946+pierce314159@users.noreply.github.com> * address some of Pierce's comments * identical column suffixes * added df.merge functions * bug fix for the right_join_merge method * add merge test for dataframe * temp test fix, order is wonky but not wrong * Update arkouda/dataframe.py --------- Co-authored-by: Eddie <eddie@MacBook-Air.local> Co-authored-by: pierce <48131946+pierce314159@users.noreply.github.com> Co-authored-by: Pierce Hayes <pierce314159@users.noreply.github.com>
Fixes #2703: Sort bug with `nan`s (#2755) * Fixes #2703: Sort bug with `nan`s This PR (fixes #2703) When a `nan` is present in `a` the value of `min reduce a` will equal `nan`. So `signbit(min reduce a)` will be false even if there are negatives present. This was causing the sort to mishandle `0.0` I updated the code to do the same thing it used to if `min reduce a` is not a `nan`, and when it is to find the signbits of all values see if any are true (i.e. `| reduce signbit(a)` I feel like calling `signbit` on every value of `a` then reducing shouldn't be too much more expensive than reducing first and doing only one `signbit` call. But I know the sort code is super optimized, so if ronawho doesn't mind looking this over and making sure I'm not doing something dumb that will kill the performance * upated in response to PR feedback --------- Co-authored-by: Pierce Hayes <pierce314159@users.noreply.github.com>
PreviousNext