8000 feat(learning classifier) | syntethic data generator by ammirsm · Pull Request #24 · zenbase-ai/core · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

feat(learning classifier) | syntethic data generator #24

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conversation

ammirsm
Copy link
Contributor
@ammirsm ammirsm commented Jul 25, 2024

No description provided.

ammirsm added 3 commits July 24, 2024 18:05
This commit message succinctly describes the main addition in the diff, which is a new feature for generating synthetic data for single class classifiers.
@ammirsm ammirsm merged commit c947d66 into amir/eng-32-featlearning-classifier-make-a-learning-classifier-by-itself Jul 25, 2024
ammirsm added a commit that referenced this pull request Jul 25, 2024
* add: single class classifier generator.

* add: adaptors.

* add: predifined classifier.

* add: tests.

* add: cookbook.

* update: readme.

* Add news dataset fixture and single class classifier tests

- Implement `news_dataset` fixture to load the 20 Newsgroups dataset.
- Create tests for `SingleClassClassifierLMFunctionGenerator`, including initialization and prediction verification.
- Ensure balanced dataset creation for training, validation, and test sets in `SingleClassClassifier`.

* Add retry logic to classifier function in SingleClassClassifierLMFunctionGenerator

Implement exponential backoff with logging for the classifier function to improve resilience during retries.

* Refactor SingleClassClassifier attributes and update test cases

- Change type hints from `Optional` to union types for clarity.
- Modify the `_create_evaluator` method to be static.
- Enhance test assertions to validate the result object and its properties.

* Add pytest mark for helpers in single class classifier test

* feat(learning classifier) | syntethic data generator (#24)

* Downgrade Faker to 24.2.0 and update lock files

* Add single class classifier synthetic data generator

This commit message succinctly describes the main addition in the diff, which is a new feature for generating synthetic data for single class classifiers.

* Add instructor package and create synthetic data generator notebook

* Convert dataset to LMDemo objects for dict and synthetic data types

* Add 'datasets' to required packages in single_class_classifier notebook

* Add single class classifier notebook with synthetic data

* Add single class classifier notebook with synthetic data

* Add environment setup and Zenbase import

* Remove parea-ai package from installation list

* Bump version to 0.0.6

* Update predefined prompts cookbooks section in README
@ammirsm ammirsm deleted the amir/eng-37-featlearning-classifier-syntethic-data-generator branch July 28, 2024 19:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant
0