<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Transformer | Tuhin Sharma</title><link>https://tuhinsharma.netlify.app/tags/transformer/</link><atom:link href="https://tuhinsharma.netlify.app/tags/transformer/index.xml" rel="self" type="application/rss+xml"/><description>Transformer</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Wed, 06 Nov 2024 14:15:00 +0000</lastBuildDate><image><url>https://tuhinsharma.netlify.app/media/icon_hu55b84836e614877e119cbfa37f6d5a66_1386708_512x512_fill_lanczos_center_3.png</url><title>Transformer</title><link>https://tuhinsharma.netlify.app/tags/transformer/</link></image><item><title>Decoding the Power of Multi-Head Attention in Transformers</title><link>https://tuhinsharma.netlify.app/blogs/multihead-attention/</link><pubDate>Wed, 06 Nov 2024 14:15:00 +0000</pubDate><guid>https://tuhinsharma.netlify.app/blogs/multihead-attention/</guid><description>&lt;h2>Description&lt;/h2>
&lt;p>In our pursuit of creating a fun dialogue completer using transformers, we chose to build a system that could complete famous quotes — like those from Cersei Lannister when she played the game of thrones. To achieve this, we first passed our input text through an embedding layer, adding position information to each word embedding to create position-aware embeddings. These were then sent forward to the next layer. In this segment, we dive deeper into the transformative heart of transformer models: the multi-head attention mechanism, which, I must warn, is a powerful multi-faceted component.&lt;/p>
&lt;p>&lt;a href="https://medium.com/@tuhinsharma121/riding-multi-headed-dragons-decoding-the-power-of-multi-head-attention-in-transformers-7c9d18dc2b68">&lt;p>&lt;strong>Read more..&lt;/strong>&lt;/p>&lt;/a>&lt;/p></description></item></channel></rss>