arxiv On Speaker Attribution with SURT