论文《Deep Multifaceted Transformers for Multi-objective Ranking in Large-Scale E-commerce Recommender》

transformers

huggingface/transformers: 是一个基于 Python 的自然语言处理库，它使用了 PostgreSQL 数据库存储数据。适合用于自然语言处理任务的开发和实现，特别是对于需要使用 Python 和 PostgreSQL 数据库的场景。特点是自然语言处理库、Python、PostgreSQL 数据库。

项目地址：https://gitcode.com/gh_mirrors/tra/transformers

免费下载资源

巴拉巴拉朵

591人浏览 · 2022-07-27 23:31:16

巴拉巴拉朵 · 2022-07-27 23:31:16 发布

京东DMT

论文地址：https://dl.acm.org/doi/pdf/10.1145/3340531.3412697
论文提出用多个Transformer对用户多种类型的行为序列进行建模，在此基础上叠加MMOE建模多目标，最后使用一个消偏塔对数据进行消偏。

拿点击/未点击作为反馈通常会有位置偏差（position bias）和近邻偏差（neighboring
bias），不过论文对于消偏的处理比较简单。

DMT的结构如下
在这里插入图片描述
输入分为两种特征 Categorical features 和 Dense features
Categorical features：

1.用户不同的行为序列。 $S = <s_1, s_2, ..., s_T>$ ，其中 $s_t=(t_i,p_i)$ 表示用户在时刻 $t_i$ 交互的物料 $p_i$ ，论文用到了点击click序列 $S_c$ ，加入购物车cart序列 $S_a$ ，购买order序列 $S_o$
2. Embedding Layer,对每个物料，使用物料id $p_i$ ，类目id $c_i$ ，品牌id $b_i$ ，商铺id $s_i$ ,分别映射成低维向量 $e_{p_i}, e_{c_i}, e_{b_i}, e_{s_i}$ ，然后concat起来，形成向量 $e_i$

Dense features: 归一化处理

1.item profile features (e.g.,number of clicks, CTR, CVR, rating) ,
2. profile features (e.g., purchase power, preferred categories and brands),
3. user-item matching features (e.g., whether the item matches the user’s gender or age)
4. user-item interaction features (e.g., number of clicks on the category of the item within a time window)

Deep Multifaceted Transformers Layer

分别用3个Transformer来对点击、加入购物车、购买行为序列进行建模。encoder中，用序列的item-Embedding作为self-attention的输入，decoder中，使用target item的Embedding作为query，encoder输出的结果作为key和value。

Multi-gate Mixture-of-Experts Layers

专家网络输出 $e_1(x),e_2(x),...,e_N(x)$ ，每个任务的门控网络 $NNG^k(x)$ 学习各个专家的权重 $w^k$ ，根据权重得到专家结果的加权和，然后送入到一个功能网络中得到任务在MMOE层的输出。
$w^k = softmax(NNG^k(x))$
$f^k(x) = \sum_{i=1}^N w_i^k e_i(x)$
$u_k = NN_U^k(f^k(x))$

Bias Deep Neural Network

专门搭了一个bais塔，输入都是偏差相关的特征，对于位置偏差就是展示位置索引编号或者网页索引编号；对于近邻偏差，输入就是目标物料的类目和邻近K个物料的类目。
biase塔的输出
$y_b = NN_B(x_b)$

Model Training and Prediction

模型输出 $y_k$
都是分类任务，使用交叉熵损失函数
$y_k = \sigma (u_k+y_b)$
$L_k = - \frac 1 N \sum_{i=1}^N y_i \log(y_k) + (1-y_i) \log (1-y_k)$
$\sum_{i=1}^K \lambda_k L_k$

上面是训练阶段，预测阶段，任务k输 $\hat y_k$ ，score由不同任务预估分加权得到，权重离线搜参得到
$\hat y_k = \sigma (u_k)$
$\hat y = \frac {\sum_{k=1}^Kw_k \hat y_k} {\sum_{k=1}^Kw_k}$

EXPERIMENTAL

在这里插入图片描述
其实这个论文是对Transformer和MMOE以及消偏做了组合，不同的Transformer对不同种类的序列分别处理，能拿到一个比较好的这个序列的Embedding结果，这种组合竟然可以很好地work，说明几个基础组件还是非常有效的。

GitHub 加速计划 / tra / transformers

130.24 K

25.88 K

下载

最近提交(Master分支：2 个月前 )

33868a05 * [i18n-HI] Translated accelerate page to Hindi * Update docs/source/hi/accelerate.md Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com> * Update docs/source/hi/accelerate.md Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com> * Update docs/source/hi/accelerate.md Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com> * Update docs/source/hi/accelerate.md Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com> --------- Co-authored-by: Kay <kay@Kays-MacBook-Pro.local> Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com> 8 天前

e2ac16b2 * rework converter * Update modular_model_converter.py * Update modular_model_converter.py * Update modular_model_converter.py * Update modular_model_converter.py * cleaning * cleaning * finalize imports * imports * Update modular_model_converter.py * Better renaming to avoid visiting same file multiple times * start converting files * style * address most comments * style * remove unused stuff in get_needed_imports * style * move class dependency functions outside class * Move main functions outside class * style * Update modular_model_converter.py * rename func * add augmented dependencies * Update modular_model_converter.py * Add types_to_file_type + tweak annotation handling * Allow assignment dependency mapping + fix regex * style + update modular examples * fix modular_roberta example (wrong redefinition of __init__) * slightly correct order in which dependencies will appear * style * review comments * Performance + better handling of dependencies when they are imported * style * Add advanced new classes capabilities * style * add forgotten check * Update modeling_llava_next_video.py * Add prority list ordering in check_conversion as well * Update check_modular_conversion.py * Update configuration_gemma.py 8 天前

GitCode 开源社区

旨在为数千万中国开发者提供一个无缝且高效的云端环境，以支持学习、使用和贡献开源项目。

更多推荐

[转载]在Windows环境下安装GNU Radio

转自：在Windows环境下安装GNURadio_恐弱智_新浪博客GNU Radio是用Python开发的，大部分开源的工程能够在Linux环境下运行良好，而Windows下却运行的很勉强，而且安装配置都很复杂。GNU Radio算是个例外了，不光提供了Windows的二进制安装，还有比较详细的说明。我是Python小白，所以折腾了好久才弄好，特意记录下来，免得以后再装还折腾。GNU Radio的

GitCode 开源社区

centOS 8 使用dnf安装Docker

DNF是什么？CentOS 8使用YUM软件包管理器版本v4.0.4。现在，该版本使用DNF(已删除YUM)。DNF是软件包管理器。它会在Linux发行版上安装，执行更新并删除软件包。使用DNF安装Docker跳过具有损坏依赖性的程序包一个有效的解决方案是使您的CentOS 8系统使用以下--nobest命令安装最符合条件的版本：sudo dnf install docker...

GitCode 开源社区

定时同步数据库表(mysql+linux+crontab)

sync.sh里面的参数需要改变，ip/username/password/database/tablesync.sh#!/bin/sh# Please change the IP and password of the data source db.# Then change the table name.filename=/home/nington/db/$(date +%Y-%m