Back to Search AI connectors listGitHub is a widely used platform for version control and collaboration, enabling developers to host, manage, and track changes in code repositories. With the GitHub On-Premise connector in Search AI, you can ingest and index issues, pull requests, files, pages, and commit messages from your self-hosted GitHub instance.The connector supports multiple authentication profiles, allowing you to configure and index content from one or more GitHub organizations simultaneously.
Search AI supports two authentication methods for GitHub On-Prem:
Personal Access Token
OAuth 2.0
Each authentication profile corresponds to a GitHub organization and requires owner or administrator permissions to ensure proper access to repositories and metadata.
DE Region: https://de-bots-idp.kore.ai/workflows/callback
Prod: https://idp.kore.com/workflows/callback
This generates client credentials. Use the device flow and client credentials to manually create an access token using an API client tool such as Postman.
Standard FilterSelect the repositories to ingest content from. All accessible repositories are listed. Select the required repositories and click Add Selection.Advanced FiltersConfigure additional filters using properties specific to each content type. The connector ingests only content that meets both standard and advanced filter criteria.Ingested FieldsFor all content types, the connector captures:
doc_source_type — identifies the content type in the ingested JSON
repository_id and repository_name — repository details
url — link to the specific object
Creation and update timestamps
For Issues, the connector also captures: issue status, comments, reporter, assignee, reactions, closure date, closed by, and labels.
All profiles are synchronized in sequence, most recently added first
Disconnected profile
Previously ingested content is retained until a manual sync or deletion
Deleted profile
All associated content is removed unless already synced through another profile
Each sync performs a full fetch of accessible content from GitHub and ingests only new or updated items into the Search AI index.Conflict HandlingIf two authentication profiles apply different field mappings to the same document, the most recent sync takes precedence.
For all content ingested from GitHub repositories, Search AI sets the repository ID as the sys_racl value. This value is stored as a permission entity. Use the Permission Entity APIs to associate users with the permission entity corresponding to each repository ID.