RxT-Beta 3B A190M Supervised Demo

Demo for supervised version of first real-scale Reactive Transformer model with 3B total params and 190M active in decoder.

Work in progress - fixing generation errors

Limitations

Supervised version of the model is still in intermediate stage and will be further improved in Direct Memory and Preference Optimization (DMPO) stage (demo will be constantly updated).