Skip to content
View DrejcPesjak's full-sized avatar

Highlights

  • Pro

Block or report DrejcPesjak

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DrejcPesjak/README.md

Today's AI News

Todays Image

AI Subreddit Recap: Highlights

AI Models and Training:

  • Mistral Small 3: Competitively performs against larger models despite being 3x faster and open-source.
  • DeepSeek R1: Performs well on local rigs without a GPU, offering impressive speed and accessibility.
  • Llama 4: Progress delayed but promised with multimodal capabilities and diverse sizes.

Industry Trends and Market Analysis:

  • Nvidia FP8 Performance Reduction: Half the performance on new RTX 40/50 GPUs, raising concerns about potential limitations.
  • Knowledge Distillation Controversy: Debate over its legality, with some suggesting it's not copyright violation.

Applications and User Experiences:

  • Copilot's Decline: Users express disappointment with recent quality and suggest a strategic shift by Microsoft.
  • ChatGPT Updates: Limited improvements with concerns about excessive emoji usage and uncertainty about future major updates.

Other News:

  • Mark Zuckerberg discusses progress on AI, emphasizing Meta's multimodal capabilities.
  • Reddit discussions explore DeepSeek-R1's impact on research and its integration into Microsoft Azure services.

Pinned Loading

  1. DPhate-double-paraphrasing-hate-speech DPhate-double-paraphrasing-hate-speech Public

    Bachelor's thesis on removing hate from online comments using paraphrasing: algorithm DPhate

    Python

  2. scaling-monosemanticity-llama scaling-monosemanticity-llama Public

    Reproducing Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet using LLaMA. This project explores monosemantic neurons in large language models, implementing and extend…

    Jupyter Notebook 2

  3. Herz-bot Herz-bot Public

    A qlearning model for the card game called Herz.

    Java

  4. unbalanced-media unbalanced-media Public

    Analysis of Unbalanced Slovenian Media News Outlets - Left vs. Right Wing

    Python

  5. weather-prediction-mlops weather-prediction-mlops Public

    ML in the cloud project for the universtiy course Cloud Computin (RSO)

    Jupyter Notebook

  6. nyc-violation-tickets-analysis nyc-violation-tickets-analysis Public

    Analysis and prediction of NYC violation tickets using big data and machine learning techniques.

    Jupyter Notebook