Skip to content
View mkurman's full-sized avatar

Block or report mkurman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. mcts-pytorch mcts-pytorch Public

    A flexible Monte Carlo Tree Search framework with PyTorch for decision-making in language models.

    Jupyter Notebook 8 1

  2. ReasonFlow ReasonFlow Public

    ReasonFlow is a novel framework designed to implement o1-like reasoning capabilities in large language models.

    Python 8 4

  3. self_reward_head_pytorch self_reward_head_pytorch Public

    This repository contains the implementation of a self-reward head designed for language models. The self-reward head enables the model to autonomously score its generated outputs, promoting self-as…

    Python 2

  4. linearmoe_pytorch linearmoe_pytorch Public

    This repo contains my custom implementation of a mixture of experts as an extension of the linear layer.

    Python 1

  5. Large-Language-Model-Notebooks-Course Large-Language-Model-Notebooks-Course Public

    Forked from peremartra/Large-Language-Model-Notebooks-Course

    Practical course about Large Language Models.

    Jupyter Notebook 1

  6. hf_data_generation hf_data_generation Public

    This repo contains simple jupyter notebook based machine-learning instruction datasets generation using open-sourced huggingface models and hugginface pro subscription.

    Python