ROCm pytorch

Used this tutorial to install pytorch for rocm, however I checked out release 1.5. https://github.com/ROCmSoftwarePlatform/pytorch/wiki/Building-PyTorch-for-ROCm Allennlp was version 0.9.

GRU

BERT

This used bert-base with a batch size of 8.

Vega FE notes

The vega frontier edition results were obtained from a rented gpueater instance.

A batch size of 16 was also tried for the vega frontier edition to see if it would fit in vram and strangely the time per epoch dropped (01:12) with the larger batch size). This was also with thermal throttling as the vega fe was hitting 87 C and the clocks were down to 1.2 Ghz from 1.6 Ghz. The fans were limited to 40% under load on gpueater.com. It would be interesting to see what the performance is like with better thermals.

GPU	BERT-base emotion regression	GRU pos-tagger (1-hid)	GRU pos-tagger (2-hid)
GTX 1070	1:26.96	0:04.2	0:04.3
Tesla M40	1:32.76	0:04.05	0:04.3
RTX 3090	0:26.2	0:02.0	0:02.6
RX580	2:14.4	0:06.9	0:08.5
Vega Frontier	1:29.3	0:04.4	0:05.1
Vega Frontier (90% fans)	1:09.1	0:02.3	0:03.0
Vega frontier (rocm 4.0)	1:07.5	0:02.4	0:02.9
i7-7800x	x	00:18	00:23
i9-7900x (defective?)	x	00:19	00:23
i9-7900x	x	00:16	00:20
i9-7980xe	x	00:15	00:18
e5-2680v3	x	00:27	00:34

using rocm apex gave no discernable performance improvement (with use_apex = true) However, it did reduce memory consumption by ~1GB for a batch of 16.

The RTX 3090 was tested with cuda 11, all other nvidia gpus were using cuda 10.2 (the RTX 3090 is not supported in this earlier version of cuda).

Redirect traffic using iptables

Make lxd container accessible outside of host machine

How to control turbo boost inside the operating system

lxc container setup

Run docker container inside of lxc container

Lock packages in zypper

Installing matrix-synapse on pypy

Download from nextcloud using wget

Install printer drivers for brother printer in opensuse tumbleweed

Install numpy-blis for amd efficiency

Allennlp

Install GTKWattman on opensuse tumbleweed

How to use Intel's mkl library on AMD systems

Install pytorch-rocm on bare metal opensuse Tumbleweed

Install aftermarket cooler on M40

Use Intel Quad bypass cards (82571EB PRO)

Build pytorch with rocm on ubuntu 20.04

Enable hibernation on asus zenbook with ryzen processor

Set up zyxel travel router in client bridged mode

Get my EGPU to work on opensuse tumbleweed with a framework 12th gen laptop

NAS

Server refresh

SLURM

July 2019

August 2019

September 2019

October 2019

NOvember 2019

December

04 April

03 March 2019

May 2019

Laptop ram upgrade

nvidia apex

CFG.jl

ROCm pytorch

Linpack results

4 nodes

5 nodes

Comparison benchmarks

Moosefs comparison

workbench container

January 2020

February 2020

March 2020

April 2020

May 2020

September 2020

September 2018

November 2018

October 2018

Speakers

January

LBA write rate on database host

March

April

December

Cluster Buster plan

Desktop shutting down on its own

ROCm pytorch

GRU

BERT

Vega FE notes

No Comments