MainframeGPT #644

venkatzhub · 2024-04-26T18:07:05Z

Project description

The project entails the training and fine tuning of an open source Gen AI Large Language Model (LLM) designed to function as a primary mentor and educator within the mainframe domain. Its primary objectives include:

Facilitating the onboarding process for new personnel to the mainframe platform by providing comprehensive guidance on understanding mainframe fundamentals and associated concepts.
Serving as a quick reference resource for experienced professionals within the mainframe ecosystem, offering assistance in accessing how-to guides and best practice recommendations.

Value Proposition: The implementation of this initiative offers significant value through the creation of a centralized repository of mainframe knowledge. This resource is instrumental in expediting the onboarding process for new recruits and streamlining the search for pertinent information. Consequently, it serves to enhance overall efficiency and productivity within the mainframe environment.

Statement on alignment with Open Mainframe Project Mission and Vision statements

Aligns with OMP vision of ensuring the mainframe remains as an integral and indispensable part of Enterprise IT by democratizing the access to mainframe concepts in an easy-to-consume fashion

Are there similar/related projects out there?

Not that we are aware of. Many organizations are considering building something similar on their own to help the onboarding process; hence, having something driven by the mainframe ecosystem in the open-source world would save a lot of time and resources.

Sponsor from TAC

To be appointed

Proposed Project Stage

Incubation

License and contribution guidelines

License: Apache2

Current or desired source control repository

GitHub

External dependencies (including licenses)

Open source LLM Mistral2 licensed under Apache 2.0 as the foundational model: https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2
Buy-in from vendors in the mainframe ecosystem
A curated set of documentation that would form the basis of the MainframeGPT

Initial committers

TBD

Infrastructure requests

TBD

Communication channels

Slack and Mailing lists

Communication channels

Github Issues

Website

TBD

Release methodology and mechanics

TBD

Social media accounts

TBD

Community size and any existing sponsorship

TBD

v1gnesh · 2024-04-27T03:06:50Z

Hi @venkatzhub, can I reach you on email? Would like to get involved.

markbsigler · 2024-05-14T19:39:54Z

I've found Mistral and Mixtral to be much more expensive for inference than Llama3. While I'd agree on using an open source LLM as a foundation, I would suggest evaluating before final selection.

Fine tuning could be very effective but also can also be costly. Retrieval-Augmented Generation (RAG) could be more cost effective. This approach keeps the LLM separate from a large knowledge base. When responding to a prompt, the LLM first searches the knowledge base for relevant information and then uses that information to generate a response. This allows the LLM to access and process much more information than it could on its own.

It would be great if IBM could contribute RedBooks to the training data.
Interskill would certainly not be thrilled about this project, since mainframe education is their core offering.

venkatzhub · 2024-05-14T20:19:48Z

Hi @venkatzhub, can I reach you on email? Would like to get involved.

Sure.

venkatzhub · 2024-05-14T20:23:23Z

I've found Mistral and Mixtral to be much more expensive for inference than Llama3. While I'd agree on using an open source LLM as a foundation, I would suggest evaluating before final selection.

Fine tuning could be very effective but also can also be costly. Retrieval-Augmented Generation (RAG) could be more cost effective. This approach keeps the LLM separate from a large knowledge base. When responding to a prompt, the LLM first searches the knowledge base for relevant information and then uses that information to generate a response. This allows the LLM to access and process much more information than it could on its own.

It would be great if IBM could contribute RedBooks to the training data. Interskill would certainly not be thrilled about this project, since mainframe education is their core offering.

We can definitely look at the different models. We experimented heavily with mistral and mixtral, and can speak to the quality of output from those models. I bet Llama3 would perform the same. The goal is to pick an open source model that is well governed.

re: How we implement, RAG has to be in the picture and that is how we have to structure ourselves.

re: Interskill, I do not think this would replace the instructor led learning, like any other AI tool, this would be an assistant. I can see ways Interskill can benefit from the use of this model to enhance their offerings.

venkatzhub · 2024-05-23T13:57:18Z

@jmertic : Some names to consider

Mainframe Mentor
Mainframe Mind
Mainframe Insight
Mainframe Assistant

v1gnesh · 2024-05-24T03:56:48Z

Easy one - wiZard :)
EDIT: Actually, IBM used the name DocBuddy too early; that would have been perfect.

jmertic · 2024-05-24T12:47:56Z

Thanks - I've requested our trademark council to review and provide feedback. Will keep you posted.

anushkasingh98 · 2024-08-27T18:07:42Z

Hey @venkatzhub , I would also love to get involved in this project.

jmertic added the 1-new-project-wg New Project or Working Group application label Apr 30, 2024

github-project-automation bot added this to Open Mainframe Project TAC Meeting Agenda Apr 30, 2024

github-project-automation bot moved this to Future Meeting Agenda Items in Open Mainframe Project TAC Meeting Agenda Apr 30, 2024

jmertic removed this from Open Mainframe Project TAC Meeting Agenda Apr 30, 2024

jmertic added this to Open Mainframe Project TAC Meeting Agenda Apr 30, 2024

github-project-automation bot moved this to Future Meeting Agenda Items in Open Mainframe Project TAC Meeting Agenda Apr 30, 2024

jmertic moved this from Future Meeting Agenda Items to On Hold in Open Mainframe Project TAC Meeting Agenda Apr 30, 2024

jmertic assigned venkatzhub May 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MainframeGPT #644

MainframeGPT #644

venkatzhub commented Apr 26, 2024

v1gnesh commented Apr 27, 2024

markbsigler commented May 14, 2024

venkatzhub commented May 14, 2024

venkatzhub commented May 14, 2024

venkatzhub commented May 23, 2024

v1gnesh commented May 24, 2024 •

edited

Loading

jmertic commented May 24, 2024

anushkasingh98 commented Aug 27, 2024

MainframeGPT #644

MainframeGPT #644

Comments

venkatzhub commented Apr 26, 2024

Project description

Statement on alignment with Open Mainframe Project Mission and Vision statements

Are there similar/related projects out there?

Sponsor from TAC

Proposed Project Stage

License and contribution guidelines

Current or desired source control repository

External dependencies (including licenses)

Initial committers

Infrastructure requests

Communication channels

Communication channels

Website

Release methodology and mechanics

Social media accounts

Community size and any existing sponsorship

v1gnesh commented Apr 27, 2024

markbsigler commented May 14, 2024

venkatzhub commented May 14, 2024

venkatzhub commented May 14, 2024

venkatzhub commented May 23, 2024

v1gnesh commented May 24, 2024 • edited Loading

jmertic commented May 24, 2024

anushkasingh98 commented Aug 27, 2024

v1gnesh commented May 24, 2024 •

edited

Loading