What kinds of datasets exist that utilize code scanning tool outputs? Any ML solutions in the literature?