ACM
ADAMw
Adadelta
Ansari
Auer
AutoInt
Babenko
Bohlke
Bruss
CIKM
CLS
CMD
CUDA
Cham
Chronos
Codecov
Contrastive
Duan
Duchi
GPUs
GeGLU
Goldblum
Goldstein
Gorishniy
Hazan
HuggingFace
Hutter
Khrulkov
Küken
LBFGS
Lifecycle
Loshchilov
MLP
MLPs
MPS
Mobilenetv
ORCID
PBC
POSIXct
Pre
RLN
RLNs
ReLU
Ren
ResNet
ResNets
Rubachev
SGD
SHA
Sandler
Schwarzschild
Segal
Shavitt
Shchur
Shi
Somepalli
Springer
Xiao
Xu
Zeiler
Zhang
Zhmoginov
Zhu
activations
adadelta
adagrad
al
doi
dtype
embeddings
ensembling
et
extensibility
funder
https
interpretability
learnable
magrittr
mlp
multilayer
natively
overspecified
perceptrons
pre
preprint
pretrained
pretraining
relu
softmax
sparsification
subgradient
tanh
th
tibble
tidymodels
tidyselect
