Iโm @HubertR21, also known as Hubert Ruczyลski
- ๐จ๐ปโ๐ I'm working as a Specialist of Credit Risk Model Validator at VeloBank,
- ๐จ๐ปโ๐ I've got my Bachelor Degree in Data Science at Faculty of Mathematics and Information Science, Warsaw University of Technology in 2023,
- ๐จ๐ปโ๐ I've got my Master's Degree in Data Science at Faculty of Mathematics and Information Science, Warsaw University of Technology in 2024,
- ๐ Iโm interested in AutoML, Data Visualization, Credit Risk, Natural Language Processing (NLP), Machine Learning Algorithms, fairness, and eXplainable Artificial Intelligence (XAI),
- ๐ My main points of interest are AutoML, and Credit Risk,
- ๐ For two and a half years I've been working scientifically as a researcher at MI^2 DataLab and participated in 3 major studies / projects (fairPAN, forester, ATLAS),
- ๐ During the summer 2023, I've got a grant from CyberSummer@WUT-3 initiative for studies, and improvements of forester package,
- ๐ In 2024 I've finilized my research regarding the data preprocessing for tree-based models and published the results of this study.
- ๐ I've been teaching Data Visualization Techinques (DVT), and Exploratory Data Analysis at Faculty of Mathematics and Information Science for 3 years,
- ๐ฆ I'm an R enthusiast and I love developing R packages, both Open Source (browse fairPAN and forester below), and commerical (at VeloBank: AutoLGD, AutoNMD, AutoPDLT),
- ๐ I also enjoy spreading the knowledge, so I've participated in 9 Machine Learning Conferences (WhyR? Conf, Coseal x 2, MLinPL x 3, Ghost Day, AutoML'23, AutoML'24),
- ๐ In 2023 the paper about forester package, titled 'forester: A Novel Approach to Accessible and Interpretable AutoML for Tree-Based Modeling' got accepted at AutoML'2023 Conference,
- ๐ In 2024 the paper titled: 'Big Tech influence over AI research revisited: memetic analysis of attribution of ideas to affiliation' has been pulished in Journal of Informetrics,
- ๐ In 2024 the paper titled: 'Do Tree-based Models Need Data Preprocessing?' has been published at AutoML'2023 Conference),
- ๐ Another paper regarding the forester project is in progress,
- ๐ซ You can reach me via my email address [email protected], or LinkedIn profile,
- โ๏ธ You can browse all the projects, presentations, papers, certificates, etc on this repository.
- Highly advanced knowledge of R: Package development, AutoML, Data Visualization, Neural Networks, ML, XAI, Fairnesss, Reporting, Statistics, Statistical Learning, Interpretability, Shiny Apps, Network Analysis,
- Advanced knowledge of Python: AutoML, ML, Data Visualizations, Data Preparation and Preprocessing, NLP, Neural Networks, Deep Learning, Bioinformatics, Voice Detection and Analysis,
- Basic knowledge of Java including JavaFX,
- Good knowledge of SQL,
- Fluent usage of versioning systems, such as Git,
- Advanced mathematic skills in matematical analysis, algebra, statistics, theory of probability, discrete mathematics, stochastic processes,
- High organizational skills and usage of appropraite tools: Notion, Slack, MS Teams,
- Experience (3 years) of working in research projects: reading and presentation of articles, designing and conducting own studies, wirtting research papaers, presenting the results on international conferences (presentations, posters, etc.),
- Ability to work in scrum,
- Good knowledge of paradigms, and experience of working and designing the relational, andnote relational databases,
- Good knowledge and experience of working with Business Intelligence (PowerBI), Big Data (Apache projects), and Cloud Computing (AWS) systems,
- Experience of working with remote machines/supercomputers (Eagle, Eden, PLGrid) with SLURM system,
- Basics of C++, matlab, Excel, Access, Word,
- Basic knowledge of Quantum Artificial Intelligence,
- High level of English around C2 level.
Feel free to browse my projects โฌ๏ธ ๐ฆ