deep learning — Page 2

Sort:

2 weeks ago · ai · - · -

Equilibrated adaptive learning rates for non-convex optimization

Overview Train deep learning models faster with a simple tweak: ESGD. Many networks get stuck on flat stretches or saddle points that slow learning down, and p...

#adaptive learning rates #ESGD #non-convex optimization #deep learning #optimization algorithms #RMSProp #stochastic gradient descent
2 weeks ago · ai · - · -

GANs Explained Simply: The Two-Neural-Network Battle That Changed AI

What is a GAN? GAN stands for Generative Adversarial Network. It was introduced in 2014 by Ian Goodfellow. A GAN consists of two neural networks that compete w...

#GAN #generative adversarial networks #deep learning #image generation #synthetic data
2 weeks ago · ai · - · -

Haar Cascades to YOLO: Face Detection Migration Guide

The 15‑Year‑Old Code That Still Runs in Production Haar Cascades are everywhere. If you've ever used OpenCV's face detector, you've used a method published in...

#face detection #Haar Cascades #YOLO #OpenCV #computer vision #model migration #deep learning
2 weeks ago · ai · - · -

[Paper] A Unified Physics-Informed Neural Network for Modeling Coupled Electro- and Elastodynamic Wave Propagation Using Three-Stage Loss Optimization

Physics-Informed Neural Networks present a novel approach in SciML that integrates physical laws in the form of partial differential equations directly into the...

#physics-informed neural networks #PINN #multi-physics simulation #piezoelectric wave propagation #deep learning
3 weeks ago · ai · - · -

Image Classification with CNNs – Part 3: Understanding Max Pooling and Results

Max Pooling In the previous articlehttps://dev.to/rijultp/image-classification-with-convolutional-neural-networks-part-2-creating-a-feature-map-gd0 we created...

#cnn #max pooling #image classification #deep learning #neural networks
3 weeks ago · ai · - · -

[Paper] Learning to Control: The iUzawa-Net for Nonsmooth Optimal Control of Linear PDEs

We propose an optimization-informed deep neural network approach, named iUzawa-Net, aiming for the first solver that enables real-time solutions for a class of ...

#optimal control #partial differential equations #deep learning #neural network architecture #numerical optimization
3 weeks ago · ai · - · -

[Paper] DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

Current unified multimodal models for image generation and editing typically rely on massive parameter scales (e.g., >10B), entailing prohibitive training co...

#multimodal-model #image-generation #diffusion-transformer #deep-learning #computer-vision
3 weeks ago · ai · - · -

[Paper] An Empirical Study of the Imbalance Issue in Software Vulnerability Detection

Vulnerability detection is crucial to protect software security. Nowadays, deep learning (DL) is the most promising technique to automate this detection task, l...

#software vulnerability detection #class imbalance #deep learning #security ML #empirical study
3 weeks ago · ai · - · -

AI in Multiple GPUs: Understanding the Host and Device Paradigm

markdown - Part 1: Understanding the Host and Device Paradigm — this article - Part 2: Point‑to‑Point and Collective Operations — coming soon - Part 3: How GPUs...

#multi‑GPU #host‑device paradigm #distributed AI #NVIDIA GPU #deep learning #parallel computing #GPU architecture
3 weeks ago · ai · - · -

Not All RecSys Problems Are Created Equal

Not All RecSys Gigs Are Created Equal The industry’s outliers have distorted our definition of recommender systems. TikTok, Spotify, and Netflix employ hybrid...

#recommender systems #recsys #deep learning #gradient boosted trees #candidate generation #personalization #TikTok #Spotify #Netflix
3 weeks ago · ai · - · -

Image Classification with Convolutional Neural Networks – Part 1: Why We Need CNNs

Why We Need CNNs In this article, we will explore image classification using convolutional neural networks. For this, we will use a simple example: X or an O....

#convolutional neural networks #image classification #deep learning #computer vision
3 weeks ago · ai · - · -

[Paper] DynamiQ: Accelerating Gradient Synchronization using Compressed Multi-hop All-reduce

Multi-hop all-reduce is the de facto backbone of large model training. As the training scale increases, the network often becomes a bottleneck, motivating reduc...

#gradient compression #distributed training #deep learning #PyTorch #CUDA

Newer posts

Older posts