Member-only story

Mistral Instruct 7B Finetuning on MedMCQA Dataset

Finetuning Mistral Instruct 7B on Google Colab using QLoRA

8 min readDec 4, 2023

MistralAI’s Mistral Instruct 7B is one of the most popular open-source Large Language Models (LLMs). It has achieved SOTA performance on many benchmarks as compared to its 7B counterparts. In this post, I’ll mention the steps required to build an LLM which can solve medical entrance exam questions. We’ll be finetuning Mistral Instruct 7B on the MedMCQA dataset and providing the comparison between the original baseline model and the finetuned model.

MedMCQA is a large-scale, Multiple-Choice Question Answering (MCQA) dataset designed to address real world medical entrance exam questions. It has more than 194k high-quality and diverse AIIMS & NEET PG entrance exam MCQs covering around 2.4k healthcare topics and 21 medical subjects. More information about the dataset is available here — medmcqa/medmcqa: A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address real world medical entrance exam questions. (github.com).

Due to GPU and memory constraints on Google Colab, we’ll use GPTQ (post-training quantized) version of Mistral Instruct 7B from HuggingFace. TheBloke has quantized Mistral Instruct 7B with GPTQ on the HuggingFace Hub. We will then use the parameter efficient LoRA technique to finetune Mistral 7B. This will keep the memory consumption under check.

If you are unaware of Mistral 7B or any of these terms like GPTQ or LoRA, I suggest you go through the following article —

Large Language Models (LLMs): A Comprehensive Guide

This article contains insightful paper and article links about LLMs.

saankhya.medium.com

You can refer this article for in-depth understanding of concepts in LLMs. It contains a curated list of some of the important papers and quality articles published online on LLMs.

Let’s first install the following libraries.

!pip install -q accelerate peft bitsandbytes
!pip install -q git+https://github.com/huggingface/transformers
!pip install -q trl py7zr auto-gptq optimum

from google.colab import…

Thanks for your nice article! I have a question. Since you are using LoRA for finetuning, basically you're trianing an 'adapter' instead of updating the original weights, right? But after you have finetuned the adapter you just pushed the fintuned…

Thanks for this great article. I have a question. What happens if we fine-tune the model to only produce the answer key instead of the entire text? Obviously it solves the matching problem, but do you know whether it affects accuracy?

Nice Article! very helpful. Please specify the package versions you use. In the future packages change, leading to unexpected errors.

if you are implementing this article =>2024 peft==0.6.2 will fix "ValueError: Attempting to unscale FP16 gradients."…

Mistral Instruct 7B Finetuning on MedMCQA Dataset

Finetuning Mistral Instruct 7B on Google Colab using QLoRA

Large Language Models (LLMs): A Comprehensive Guide

This article contains insightful paper and article links about LLMs.

Create an account to read the full story.

Written by Saankhya Mondal

Responses (5)

More from Saankhya Mondal

Life Lessons From The Shawshank Redemption

The Shawshank Redemption is a powerful and inspiring story that teaches lessons about the importance of hope, friendship, and perseverance.

Normalized Discounted Cumulative Gain (NDCG) — The Ultimate Ranking Metric

NDCG — The Rank-Aware Metric for Evaluating Recommendation Systems

Kickstart Your Data Science Journey — A Guide for Aspiring Data Scientists

Key Technical Skills You Need to Kick-start Your Career in Data Science

The Multi-Armed Bandit Problem—A Beginner-Friendly Guide

Understanding the exploitation-exploration trade-off with an example

Recommended from Medium

Fine-Tuning Large Language Models for Function Calling with LoRA

In this blog post, we’ll explore how to fine-tune large language models (LLMs) for function calling using LoRA (Low-Rank Adaptation). This…

Fine-Tuning Gemma2 Models for Tamil News Translation with HuggingFace BnB, Uploading Model Weights…

First of all, Merry Christmas! Hi Mom! Nope, I still haven’t learned Tamil… Instead, I will take the easy road and let LLMs handle it for…

How to choose between pre training, fine tuning, and model distillation.

A simple guide to pros and cons.

Fine-Tuning Google’s Gemma-3-12B for Reasoning: How GRPO Turned a Good Model into a Brilliant…

Artificial intelligence can speak fluently, create art, and even pass exams — but logical reasoning remains its ultimate frontier. How do…

How to process data for LLM Fine Tuning ?

Data processing for LLM (Large Language Model) fine-tuning involves preparing the dataset in a way that the model can effectively learn…

Fine Tune Large Language Model (LLM) on a Custom Dataset with QLoRA

The field of natural language processing has been revolutionized by large language models (LLMs), which showcase advanced capabilities and…