Skip to content
View maxmelichov's full-sized avatar

Block or report maxmelichov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
maxmelichov/README.md
Machine Learning Banner

Hi, I'm Maxim Melichov πŸ‘‹

Data Scientist & AI Engineer specializing in Speech, NLP, and Computer Vision

MSc in Data Science & Machine Learning @ Reichman University
Building and deploying high-performance models with a focus on efficiency (ONNX/TensorRT) and MLOps.

πŸ† 1st Place - Kan News Hebrew Synthetic Voice Competition
🐦 Top 2% (38/2025) - Kaggle BirdCLEF 2025 Competition


πŸš€ Selected Projects

🎧 MamreVoice β€” State-of-the-Art Hebrew TTS

An end-to-end Text-to-Speech engine designed for expressive prosody, robust zero-shot voice cloning, and production-grade latency. The entire stack is optimized with ONNX & TensorRT for efficient, private deployment.

➑️ View Live Demo & Details

πŸ”‘ Phonikud β€” Phonetics-Aware Hebrew Diacritization

A practical Niqqud (vowelization) model that incorporates phonetic awareness to achieve superior accuracy on modern Hebrew. This work emphasizes sentence-level evaluation and clean, deployable code.

➑️ Read the Paper / Project Page

🩺 RSNA β€” Lumbar Spine Abnormality Detection

A level-aware 2.5D/3D classification pipeline for analyzing lumbar spine MRIs (Sagittal T1/T2, Axial T2). The model uses segmentation-guided cropping and a transformer-based combiner to achieve robust stenosis grading across L1–L5 vertebrae.

Kaggle Competition Entry

🐦 BirdCLEF 2025 β€” Large-Scale Bioacoustics

Developed a highly efficient pipeline for large-scale audio classification. Achieved top 2% (rank 38/2025) with an AUC of 0.902 through disciplined validation, weighted-blend ensembling, and optimized audio feature extraction.

Kaggle Competition Entry

πŸ› οΈ Technical Toolbox

Languages & Core Libraries

Python C MySQL NumPy Pandas Scikit-learn Matplotlib

Machine Learning & Deep Learning

PyTorch TensorFlow Keras TorchAudio

Deployment & MLOps

ONNX TensorRT CUDA Git MLOps

Pinned Loading

  1. thewh1teagle/phonikud thewh1teagle/phonikud Public

    Hebrew grapheme to phoneme (G2P)

    Python 72 8

  2. Zonos-Hebrew Zonos-Hebrew Public

    Python 7 1

  3. Text-To-speech Text-To-speech Public

    Roboshaul

    Python 18 4

  4. Anomaly-Detection Anomaly-Detection Public

    Jupyter Notebook 14 3

  5. RSNA-2024-Lumbar-Spine-Degenerative-Classification RSNA-2024-Lumbar-Spine-Degenerative-Classification Public

    Jupyter Notebook

  6. DiffMamba DiffMamba Public

    Forked from state-spaces/mamba

    DiffMamba SSM architecture

    Python