Twitter reveals some of its source code, including its recommendation algorithm

Kyle Wiggers
TechCrunch
Fri, 31 Mar 2023 18:12 UTC

As repeatedly promised by Twitter CEO Elon Musk, Twitter has opened a portion of its source code to public inspection, including the algorithm it uses to recommend tweets in users' timelines.

On GitHub, Twitter published two repositories containing code for many parts that make the social network tick, including the mechanism Twitter uses to control the tweets users see on the For You timeline. In a blog post, Twitter characterized the move as a "first step to be[ing] more transparent" while at the same time "[preventing] risk" to Twitter itself and people on the platform.

On a Twitter Spaces session today, Musk clarified:

"Our initial release of the so-called algorithm is going to be quite embarrassing, and people are going to find a lot of mistakes, but we're going to fix them very quickly."

"Even if you don't agree with something, at least you'll know why it's there, and that you're not being secretly manipulated ... The analog, here, that we're aspiring to is the great example of Linux as an open source operating system ... One can, in theory, discover many exploits for Linux. In reality, what happens is the community identifies and fixes those exploits."

On that second point in the blog post about preventing risk, the open source releases don't include the code that powers Twitter's ad recommendations or the data used to train Twitter's recommendation algorithm. Moreover, they include few instructions on how to inspect or actually use the code — reinforcing the idea that the releases are strictly developer-focused.

"[We excluded] any code that would compromise user safety and privacy or the ability to protect our platform from bad actors, including undermining our efforts at combating child sexual exploitation and manipulation," Twitter wrote. It's a bit of mixed messaging coming only weeks after Twitter fired much of its ethical AI and trust and safety staff, which was responsible for content moderation among other user security-related tasks. But the company nonetheless insists that it "[took] steps to ensure that user safety and privacy would be protected" with today's code release.

Twitter says it's working on tools to manage code suggestions from the community and sync changes to its internal repository. Presumably, those will be made available at a future date — there's no sign of them at the present.

"We're going to look for suggestions, not just on bugs but also on how the algorithm should work," Musk said on the Spaces session. "It's going to be an evolving process. I wouldn't expect it to be a nonstop upward movement... but we're very open to what would improve the user experience."

At first glance, the algorithm is fairly complex — but not necessarily surprising in any way from a technical standpoint. It's made up of multiple models, including a model for detecting "not safe for work" or abusive content, determining the likelihood of a Twitter user interacting with another user and calculating a Twitter user's "reputation." (It's unclear what "reputation" refers to, exactly; the high-level documentation isn't clear on that.) Several neural networks are responsible for ranking the tweets and recommending accounts to follow, while a filtering component hides tweets to — forgive the jargon — "support legal compliance, improve product quality, increase user trust, protect revenue through the use of hard-filtering, visible product treatments and coarse-grained downranking."

In an engineering blog post, Twitter reveals more about the recommendation pipeline, which it claims runs approximately five billion times per day:

"We attempt to extract the best 1,500 tweets from a pool of hundreds of millions ... Today, the For You timeline consists of 50% [tweets from people you don't follow] and 50% [tweets from people you follow] on average, though this may vary from user to user," Twitter wrote. "Ranking [tweets] is achieved with a ~48-million-parameter neural network that is continuously trained on tweet interactions to optimize for positive engagement (e.g. likes, retweets and replies)."

Twitter users don't see the full 1,500 tweets, of course. They're filtered according to content restrictions and other criteria and factors considered by the models, like if tweets have "negative feedback" and if they're mainly from the same Twitter user, or from users who've been blocked or muted.

Gizmodo notes that one thing that doesn't appear to have been made public is the list of VIPs that Twitter pushes to users. This week, Platformer reported that Twitter has a rotating list of noteworthy users, including YouTuber Mr. Beast and Daily Wire founder Ben Shapiro, that it uses to monitor changes to the recommendation algorithm by increasing the visibility of these "power users" seemingly at will.

There's more evidence that the algorithm may treat tweets differently depending on the source. Researcher Jane Manchun Wong noted that Twitter's algorithm specifically labels whether the tweet author is Elon Musk and has others labels indicating whether the author is a "power user" as well as whether they're a Republican or Democrat.

During the Spaces session this afternoon, a Twitter engineer said that the labels were used only for metrics. But Musk — who said he wasn't aware of the labels prior to today — said that they shouldn't be there.

"It definitely shouldn't be dividing people into Republicans and Democrats, that makes no sense," Musk said.

The release of the source code comes after several controversies involving tweaks to Twitter's recommendation algorithm in recent months. According to Platformer, in February, Musk called on Twitter's engineers to reconfigure the algorithm so his tweets would be more widely viewed. (Twitter later walked back this change — at least somewhat.) In November, Twitter began showing users more tweets from people they don't follow — a move the platform attempted prior to Musk's acquisition but later reversed after a backlash from users.

Reader Comments

Joan · 2023-04-02T03:13:19Z

Only some?..Why not all, in the Musk, disclosure, of truth?

And his previous disclosures with Maxx Tiabii

danny esq · 2023-04-03T06:35:42Z

The base notes of perfumery makes another odor.

kksalm · 2023-04-03T08:10:32Z

As somebody that doesn’t do the twitter I’m all wtf? and whotf? cares. It’s all distraction and maybe it’s time you should pick up that book your friend or family member recommended. I’m reminded of some previous advice from another generation, kill your tv.

Come on people, twitter? Does it matter?

Have a wonderful day without it.

Bezel Bub · 2023-04-03T19:03:51Z

I first checked out Twitter when they went live, found it boring and useless to me, so i left. Out of curiosity I checked it out when he bought it and my impression is that it has not changed much, if at all. The weird thing now, though? It has devolved into a "Mush Fan Club"...he makes pretty boring comments, and then has a small army of fan bois fawning over him. Still weird...maybe Ill check back in another ten years...or not....

DonalDd · 2023-06-11T11:03:05Z

I agree with you. I don't like Twitter either. Due to the character limit, tweets often lack the necessary context to fully understand the intended meaning. Also Twitter's algorithm tends to show users content that aligns with their existing views and preferences, which can lead to the formation of echo chambers. And the app itself has a lot of problems. Twitter's API (Application Programming Interface) imposes certain limitations on developers, such as rate limits and restrictions on accessing certain features or data. If you're making a mobile app for Twitter that costs 44 billion dollars, you could try to do better by regularly testing and collecting feedback to release updates based on that [url=Mobile App Testing Services | Mobile App Testing company | QAwerk]link . Otherwise, it's not clear what you're doing.

I took this route, cause it was quicker... [Link]

JTF Truth

" Conquering tyranny will be bloody and require good men to do bad things " - Ive been to war, it is ugly. However, going back through history...

Spur2

I try to keep half an eye on solar activity, which is easy to do these days. Often if there is a lot of activity, with a little imagination its...

Ben

Oh my goodness...an absolute masterclass on why Trump is such a bitch ass motherfucker. How can maga tards still support this douche? He's a super...

zaetheric

Ancient peoples practiced the magic healing ritual of keeping warm by the fire. Surprisingly, people are STILL practicing this ancient ritualistic...

guard4her

Science & Technology

Twitter reveals some of its source code, including its recommendation algorithm

Reader Comments

Latest News

Picture of the Day

Quote of the Day

Recent Comments

Quantum Quirk