deep learning

Sort:

2 days ago · ai · - · -

Understanding LSTMs – Part 7: LSTM in Action with Real Data

Introduction In the previous article, we completed all three stages of the LSTM: the Forget Gate, Input Gate, and Output Gate. Now, let us use the LSTM with re...

#LSTM #time series #stock prediction #deep learning #neural networks #sequence modeling
4 days ago · ai · - · -

Understanding LSTMs – Part 6: How LSTM Produces Its Final Output

In the previous article we went through the input gate; in this article we will explore the next component. Final Stage: Updating the Short-Term Memory This fin...

#LSTM #recurrent neural networks #deep learning #output gate #tanh activation #sigmoid activation #machine learning
5 days ago · ai · - · -

Building a Minimal Transformer for 10-digit Addition

Article URL: https://alexlitzenberger.com/blog/post.html?post=/building_a_minimal_transformer_for_10_digit_addition Comments URL: https://news.ycombinator.com/i...

#transformer #minimal model #addition #neural networks #deep learning #machine learning #arithmetic AI
1 week ago · ai · - · -

Understanding LSTMs – Part 5: The Input Gate Explained

The Input Gate Explained In the previous article, we went through the second and third components of an LSTM. We will deepen that understanding here. Starting...

#LSTM #input gate #tanh #neural networks #deep learning #recurrent neural network
1 week ago · ai · - · -

Understanding LSTMs – Part 4: How LSTM Decides What to Forget

In the previous article, we completed the first part of the LSTM and obtained the result from the calculation. Let us continue. Forget Gate When the input was 1...

#LSTM #forget gate #recurrent neural networks #deep learning #sigmoid activation #machine learning
2 weeks ago · ai · - · -

VoxCPM: A Novel Tokenizer-Free Approach to Context-Aware Speech Generation and Voice Cloning

Exploring VoxCPM: A Tokenizer-Free Approach to Advanced Speech Synthesis and Voice Cloning In the rapidly evolving field of AI, breakthroughs in speech technol...

#speech synthesis #text-to-speech #voice cloning #tokenizer-free #deep learning #neural networks #context-aware generation #VoxCPM
2 weeks ago · ai · - · -

AI in Multiple GPUs: How GPUs Communicate

Series: Distributed AI Across Multiple GPUs - Part 1: Understanding the Host and Device Paradigmhttps://towardsdatascience.com/understanding-the-host-and-devic...

#multi‑GPU #distributed training #GPU communication #deep learning #gradient synchronization #parallelism #CUDA #NCCL
2 weeks ago · ai · - · -

Fei-Fei Li's World Labs raised $1B from A16Z, Nvidia to advance its world models

Article URL: https://www.bloomberg.com/news/articles/2026-02-18/ai-pioneer-fei-fei-li-s-startup-world-labs-raises-1-billion Comments URL: https://news.ycombinat...

#Fei-Fei Li #World Labs #AI funding #a16z #Nvidia #world models #deep learning #generative AI #AI startup
2 weeks ago · it · - · -

SpaceX vets raise $50M Series A for data center links

Travis Brashears, Cameron Ramos and Serena Grown‑Haeberli began collaborating at SpaceX, developing optical communications links that keep thousands of Starlink...

#SpaceX #Mesh Optical Technologies #optical transceivers #data center hardware #deep learning #GPU interconnect #Series A funding #Thrive Capital
2 weeks ago · ai · - · -

Understanding AI from First Principles: Multi-Layer Perceptrons and the Hidden Layer Breakthrough

“The perceptron has many limitations… the most serious is its inability to learn even the simplest nonlinear functions.” – Marvin Minsky The Problem That Stump...

#perceptron #XOR problem #neural networks #hidden layer #deep learning #machine learning fundamentals
2 weeks ago · ai · - · -

Pruning in Deep Learning: Structured vs Unstructured

Introduction Deep learning models are becoming larger and more powerful every year. From mobile vision systems to large language models, the number of paramete...

#model pruning #deep learning #structured pruning #unstructured pruning #model compression #edge AI #inference optimization #neural network efficiency
2 weeks ago · ai · - · -

[Paper] AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories

Maintaining spatial world consistency over long horizons remains a central challenge for camera-controllable video generation. Existing memory-based approaches ...

#video generation #spatial memory #computer vision #deep learning #transformer

Newer posts

Older posts