Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regression label quality scores #572

Merged
merged 127 commits into from
Jun 24, 2023
Merged
Show file tree
Hide file tree
Changes from 2 commits
Commits
Show all changes
127 commits
Select commit Hold shift + click to select a range
a9282da
added basic regression ranking
krmayankb Oct 11, 2022
0503595
minor fixes, docstring modified
krmayankb Nov 4, 2022
0a0c41e
tutorial added, added to docs index pages
krmayankb Nov 10, 2022
f4a8d17
unit tests added
krmayankb Nov 10, 2022
5aee141
reindexed tutorial, punctuation fix for docstring
krmayankb Nov 10, 2022
03fbc18
plots changed in tutorial notebook
krmayankb Nov 15, 2022
29d6080
typo fix
krmayankb Dec 8, 2022
bf7860e
cleanlab outlier based scoring method added
krmayankb Dec 9, 2022
9bf8a5f
regression_utils created
krmayankb Dec 9, 2022
c399ffc
pred_labels changed to predictions
krmayankb Dec 12, 2022
2519550
unit tests for new scoring method
krmayankb Dec 22, 2022
9d00253
init merge conflict resolved
krmayankb Dec 23, 2022
1a9409f
tutorial draft1
krmayankb Dec 22, 2022
8705398
tutorial draft1
krmayankb Dec 22, 2022
c081913
merge conflict
krmayankb Dec 23, 2022
5a8a22d
Merge branch 'master' into regression
elisno Dec 23, 2022
d80e077
default modified for method in docstring
krmayankb Dec 23, 2022
02defb9
grammatical correction in rank.py
krmayankb Dec 23, 2022
4a0a9ef
Update cleanlab/regression/rank.py
krmayankb Dec 23, 2022
f2c5862
rank.py updates
krmayankb Dec 24, 2022
7d97c26
outre: code updated for name for second method
krmayankb Dec 24, 2022
305d11b
Apply Docstring suggestions from code review
krmayankb Dec 24, 2022
e9468dd
Support for array_like labels and predictions
krmayankb Dec 24, 2022
6dbe3fb
Merge branch 'regression' of https://github.com/krmayankb/cleanlab in…
krmayankb Dec 24, 2022
48c3f57
doctring for method modified
krmayankb Dec 24, 2022
65e1a3c
datapoint -> example
krmayankb Dec 24, 2022
7b15cba
check_valid_inputs update
krmayankb Dec 24, 2022
b9b9104
tutorial removed
krmayankb Dec 27, 2022
c05f1fe
support for array_like
krmayankb Dec 28, 2022
d6ac642
unit tests to factor array_like
krmayankb Dec 28, 2022
2b327c3
Update docs/source/tutorials/index.rst
krmayankb Dec 28, 2022
4283a67
added basic regression ranking
krmayankb Oct 11, 2022
53455bf
minor fixes, docstring modified
krmayankb Nov 4, 2022
987ae0e
tutorial added, added to docs index pages
krmayankb Nov 10, 2022
7f9372b
unit tests added
krmayankb Nov 10, 2022
581c1f0
reindexed tutorial, punctuation fix for docstring
krmayankb Nov 10, 2022
13ab45e
plots changed in tutorial notebook
krmayankb Nov 15, 2022
0eac776
typo fix
krmayankb Dec 8, 2022
1a65c9a
cleanlab outlier based scoring method added
krmayankb Dec 9, 2022
e8a9a49
regression_utils created
krmayankb Dec 9, 2022
98930fc
pred_labels changed to predictions
krmayankb Dec 12, 2022
e4e6307
unit tests for new scoring method
krmayankb Dec 22, 2022
af2454b
init merge conflict resolved
krmayankb Dec 23, 2022
be8afaa
tutorial draft1
krmayankb Dec 22, 2022
ea2f723
tutorial draft1
krmayankb Dec 22, 2022
f9af6eb
merge conflict
krmayankb Dec 23, 2022
00bcf61
default modified for method in docstring
krmayankb Dec 23, 2022
542e30f
grammatical correction in rank.py
krmayankb Dec 23, 2022
3958b58
Update cleanlab/regression/rank.py
krmayankb Dec 23, 2022
db0bb5d
rank.py updates
krmayankb Dec 24, 2022
0ea2981
outre: code updated for name for second method
krmayankb Dec 24, 2022
9ab2092
Support for array_like labels and predictions
krmayankb Dec 24, 2022
c078d67
Apply Docstring suggestions from code review
krmayankb Dec 24, 2022
d1518da
doctring for method modified
krmayankb Dec 24, 2022
a819fe4
datapoint -> example
krmayankb Dec 24, 2022
ac52da7
check_valid_inputs update
krmayankb Dec 24, 2022
8394ee1
tutorial removed
krmayankb Dec 27, 2022
569b2ff
support for array_like
krmayankb Dec 28, 2022
86532b0
unit tests to factor array_like
krmayankb Dec 28, 2022
cb596a9
Update docs/source/tutorials/index.rst
krmayankb Dec 28, 2022
27ccc26
merge master to regression
krmayankb Dec 29, 2022
313faee
Merge branch 'regression' of https://github.com/krmayankb/cleanlab in…
krmayankb Dec 29, 2022
6c16a1a
unused imports removed
krmayankb Dec 29, 2022
f2b5f4e
Merge branch 'cleanlab:master' into regression
krmayankb Dec 29, 2022
54ae993
tutorial added
krmayankb Dec 30, 2022
a25b236
default, frac_neighbors 0.1 -> 0.5
krmayankb Dec 30, 2022
6bf61c6
updated tutorial notebook
krmayankb Dec 30, 2022
15bfa43
review suggestion updated
krmayankb Dec 30, 2022
feb5797
suggestion in test corrected
krmayankb Dec 30, 2022
3a49cfd
copyright updated
krmayankb Dec 30, 2022
cdfa82d
Error message suggestions updated
krmayankb Dec 30, 2022
7a25675
Copyright update
krmayankb Dec 30, 2022
f4571b9
Suggestions from code review
krmayankb Dec 30, 2022
22728d8
black formatting
krmayankb Dec 30, 2022
79a509f
import cell correction
krmayankb Dec 30, 2022
96d4ae0
estimator update to 10
krmayankb Dec 30, 2022
1fe5b37
example in docstring updated
krmayankb Dec 30, 2022
56bc771
chracterization test added
krmayankb Dec 30, 2022
3d71ae1
notebook output cleared
krmayankb Dec 30, 2022
1e078e0
specify better default values
jwmueller Jan 4, 2023
044c5aa
ignore flake8 flagging unused submodule imports
jwmueller Jan 5, 2023
8e83b37
test_all_close commented
krmayankb May 7, 2023
f1755e4
Merge branch 'regression' of https://github.com/krmayankb/cleanlab in…
krmayankb May 7, 2023
6df4974
Merge branch 'master' into regression
krmayankb May 7, 2023
9e840a3
unit test fixed
krmayankb May 7, 2023
7354708
Merge branch 'regression' of https://github.com/krmayankb/cleanlab in…
krmayankb May 7, 2023
147d383
update clipping to use value from constants.py
huiwengoh May 15, 2023
5814b62
Merge branch 'cleanlab:master' into regression
huiwengoh May 16, 2023
4eddcde
v0 of cleanlearning
huiwengoh May 19, 2023
8dc6fd6
add sample weight
huiwengoh May 19, 2023
0095c46
add save_space
huiwengoh May 19, 2023
6f14791
add some type checking + error catching
huiwengoh May 22, 2023
48fd0a0
add unittests
huiwengoh May 22, 2023
b367479
fix typing
huiwengoh May 23, 2023
1c720ff
add docs structure
huiwengoh May 23, 2023
624d639
add docstrings
huiwengoh May 25, 2023
a6381b0
add docs for helper methods
huiwengoh May 25, 2023
d761159
fix mypy
huiwengoh May 25, 2023
7f3008f
add tutorial
huiwengoh Jun 7, 2023
48f83b1
fix mypy
huiwengoh Jun 7, 2023
f851cc2
Merge branch 'master' into regression
huiwengoh Jun 7, 2023
95eb858
Suggestions from review
huiwengoh Jun 8, 2023
7359318
update notebook + misc cleanup
huiwengoh Jun 8, 2023
40955ee
Merge branch 'cleanlab:master' into regression
huiwengoh Jun 8, 2023
a0f4906
add unittests
huiwengoh Jun 8, 2023
c1420a0
update uncertainty functions
huiwengoh Jun 9, 2023
a62dc55
clarify OHE is optional
jwmueller Jun 23, 2023
050628a
clarify block is about your dataset
jwmueller Jun 23, 2023
8f2f580
remove label issue terminology from quickstart
jwmueller Jun 23, 2023
7310dbe
language for cleanlearning description
jwmueller Jun 23, 2023
44bc19d
clarify what label is
jwmueller Jun 23, 2023
7bc1563
clarify label issue
jwmueller Jun 23, 2023
ff10d47
second defining of label issue
jwmueller Jun 23, 2023
b2ab7ce
comma to slash
jwmueller Jun 23, 2023
9bb39ca
erroneous
jwmueller Jun 23, 2023
7eba836
rank module header language
jwmueller Jun 23, 2023
46fb58f
rmv ...
jwmueller Jun 23, 2023
53d73a2
clarify second set of label quality scores
jwmueller Jun 23, 2023
f12153f
typo fix
jwmueller Jun 23, 2023
c04fa09
discuss methods and paper
jwmueller Jun 23, 2023
26e909d
cleanlearning docstring edits
jwmueller Jun 23, 2023
66516d9
duplicated issues
huiwengoh Jun 23, 2023
65c08c0
Merge branch 'master' into regression
huiwengoh Jun 23, 2023
eb514d5
make methods private
huiwengoh Jun 23, 2023
d53b6a6
clear notebook outputs
huiwengoh Jun 23, 2023
3da126d
black formatting
huiwengoh Jun 23, 2023
f138ae8
update docs
huiwengoh Jun 23, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
43 changes: 43 additions & 0 deletions cleanlab/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -10,3 +10,46 @@
from . import token_classification
from . import multilabel_classification
from . import regression


class DatalabUnavailable:
def __init__(self, message):
self.message = message

def __getattr__(self, name):
message = self.message + f" (raised when trying to access {name})"
raise ImportError(message)

def __call__(self, *args, **kwargs):
message = (
self.message + f" (raised when trying to call with args: {args}, kwargs: {kwargs})"
)
raise ImportError(message)


def _datalab_import_factory():
try:
from .datalab.datalab import Datalab as _Datalab

return _Datalab
except ImportError:
return DatalabUnavailable(
"Datalab is not available due to missing dependencies. "
"To install Datalab, run `pip install cleanlab[datalab]`."
)


def _issue_manager_import_factory():
try:
from .datalab.issue_manager import IssueManager as _IssueManager

return _IssueManager
except ImportError:
return DatalabUnavailable(
"IssueManager is not available due to missing dependencies for Datalab. "
"To install Datalab, run `pip install cleanlab[datalab]`."
)


Datalab = _datalab_import_factory()
IssueManager = _issue_manager_import_factory()
3 changes: 3 additions & 0 deletions setup.cfg
Original file line number Diff line number Diff line change
Expand Up @@ -6,3 +6,6 @@ per-file-ignores =
cleanlab/token_classification/__init__.py: F401
cleanlab/benchmarking/__init__.py: F401
cleanlab/regression/__init__.py: F401
cleanlab/datalab/*/__init__.py: F401
cleanlab/models/__init__.py: F401
cleanlab/multilabel_classification/__init__.py: F401