8000 Implement Fuzzy Search for Duplicate Issue Detection · Issue #19 · warestack/action-copilot · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Implement Fuzzy Search for Duplicate Issue Detection #19

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
dkargatzis opened this issue May 14, 2024 · 2 comments
Open

Implement Fuzzy Search for Duplicate Issue Detection #19

dkargatzis opened this issue May 14, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@dkargatzis
Copy link
Member

Objective

Integrate a fuzzy search algorithm into the GitHub Action to detect and prevent the creation of duplicate issues based on similarities in their titles and descriptions.

Details

  • Algorithm Choice: Utilize the Levenshtein distance or Jaccard Index to measure the similarity between the title and description of the new issue and existing issues in the repository.
  • Threshold Setting: Define a similarity threshold to determine when an issue is considered a duplicate. Issues surpassing this threshold will not be created, and a reference to the existing similar issue will be provided instead.
  • Implementation Considerations:
    • Ensure that the search algorithm does not significantly impact the execution time of the action.
    • Handle edge cases where the issue descriptions or titles are exceptionally long or contain special formatting.
  • Quality Assurance: Include tests to validate the accuracy and efficiency of the fuzzy search implementation. Ensure that the feature does not interfere with the existing functionalities of the GitHub Action.

Benefits

  • Reduced Redundancy: Minimize duplicate issues, making the issue tracker easier to manage and navigate.
  • Enhanced Efficiency: Save time for developers and maintainers by automatically preventing duplicate issue submissions.
  • Improved Issue Management: Streamline issue management processes by automatically identifying and referencing similar existing issues.
@dkargatzis dkargatzis added the enhancement New feature or request label May 14, 2024
@cgoncalves94
Copy link
Member

We are testing.

Copy link
warestack-dev bot commented Jul 18, 2024

This is a test comment to verify the commenting functionality.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants
0