Mandatory warning: Whatever you're measuring with sentiment detection, it's probably not sentiment, and it's extremely inconsistent across implementations and humans.<p>This wonderful paper shows how unreliable sentiment detection is: <a href="https://www.tandfonline.com/doi/pdf/10.1080/19312458.2020.1869198" rel="nofollow">https://www.tandfonline.com/doi/pdf/10.1080/19312458.2020.18...</a><p>Summary: (1) The best performance is still attained with trained human or crowd coding; (2) None of the used dictionaries come close to acceptable levels of validity; and (3) machine learning, especially deep learning, substantially outperforms dictionary-based methods but falls short of human performance