Skip to main content
Ctrl+K

Philip May

  • Machine Learning
  • Python
  • IT
  • Linux
  • Blog
  • About Me
  • Machine Learning
  • Python
  • IT
  • Linux
  • Blog
  • About Me

Recent Posts

  • 09 August - The selection of topic-specific texts from Wikipedia
  • 02 July - Pandas Data Format and Compression
  • 11 April - The importance of chat templates
  • 18 November - Pandas apply
  • 12 October - Options for Date Encoding

Archives

  • 2024 (3)
  • 2023 (1)
  • 2022 (5)
  • 2021 (1)
  • 2020 (1)
  • All posts

Posted in 2024

09 August 2024 - The selection of topic-specific texts from Wikipedia

02 July 2024 - Pandas Data Format and Compression

11 April 2024 - The importance of chat templates

Posted in 2023

18 November 2023 - Pandas apply

Posted in 2022

12 October 2022 - Options for Date Encoding

23 July 2022 - Python Installation and Package Management with conda and pip

23 February 2022 - Anomalies in the MLSUM Dataset

22 February 2022 - Clean German Wikipedia Text Corpus released

20 February 2022 - LightGBM with Optuna: Demo released

Posted in 2021

10 April 2021 - German colossal, cleaned Common Crawl corpus (GC4) released

Posted in 2020

01 December 2020 - Training and Evaluation of our German Electra Language Model Talk

© Copyright 2020-2024 Philip May.