March 2020

March 4 2020

Trying to get tesla m40 into z620

UEFI is required!! I converted the z620 machine (mwanafunzi) from legacy boot to uefi boot and switched the gpu from legacy to efi support. Here is the pastebin output from that.

This worked very well, with the tesla drivers, I got the m40 gpu to show up with all 24 GB of VRAM.

I tested this out with allennlp and it worked quite well. The m40 trained an rnn about as fast as my 1070's. This is the worst case for this comparison because the 1070 has a higher clock rate but fewer cuda cores and rnn's are difficult to parallelize.

The additional vram allowed me to go up to 128 for the batch size.

However, my cooling solution for the m40 was insufficient. After about 1 epoch of training (5 minutes of heavy usage) the temp exceeded 80 C and I had to end the workload. I was using a single NF 4x20A fan from noctua. However, these only provide 5 CFM of airflow. I purchased a 2 pack of delta 40mm fans that achieve 10 CFM each. This should be enough airflow for operation. While the noise level is going up, it is only increasing from 17 dba to 35 dba (per the documentation for the respective products). since the z620 case fans are about 35 dba, this noise difference shouldn't be very noticable.

Redirect traffic using iptables

Make lxd container accessible outside of host machine

How to control turbo boost inside the operating system

lxc container setup

Run docker container inside of lxc container

Lock packages in zypper

Installing matrix-synapse on pypy

Download from nextcloud using wget

Install printer drivers for brother printer in opensuse tumbleweed

Install numpy-blis for amd efficiency

Allennlp

Install GTKWattman on opensuse tumbleweed

How to use Intel's mkl library on AMD systems

Install pytorch-rocm on bare metal opensuse Tumbleweed

Install aftermarket cooler on M40

Use Intel Quad bypass cards (82571EB PRO)

Build pytorch with rocm on ubuntu 20.04

Enable hibernation on asus zenbook with ryzen processor

Set up zyxel travel router in client bridged mode

Get my EGPU to work on opensuse tumbleweed with a framework 12th gen laptop

NAS

Server refresh

SLURM

July 2019

August 2019

September 2019

October 2019

NOvember 2019

December

04 April

03 March 2019

May 2019

Laptop ram upgrade

nvidia apex

CFG.jl

ROCm pytorch

Linpack results

4 nodes

5 nodes

Comparison benchmarks

Moosefs comparison

workbench container

January 2020

February 2020

March 2020

April 2020

May 2020

September 2020

September 2018

November 2018

October 2018

Speakers

January

LBA write rate on database host

March

April

December

Cluster Buster plan

Desktop shutting down on its own

March 2020

March 4 2020

Trying to get tesla m40 into z620

AMD GPU build (RX580)

resnet50: 49.93 images per second

resnet 152: 20.83 images per second

inception v3: 20.03 images per second

No Comments