在Hadoop中为了方便集群中各个组件之间的通信，它采用了RPC，当然为了提高组件之间的通信效率以及考虑到组件自身的负载等情况，Hadoop在其内部实现了一个基于IPC模型的RPC。关于这个RPC组件的整体情况我已经介绍过了，相关阅读：http://www.linuxidc.com/search.aspx?Where=Nkey&Keyword=Hadoop 。而在本文，我将结合源代码详细地介绍它在客户端的实现。

先来看看与RPC客户端相关联的一些类吧！

1.Client类

[java]

private Hashtable<ConnectionId, Connection> connections = new Hashtable<ConnectionId, Connection>(); //与远程服务器连接的缓存池
private Class<? extends Writable> valueClass; //远程方法调用发回后返回值解析器
final private int maxIdleTime; //连接的最大空闲时间
final private int maxRetries; //Socket连接时，最大Retry次数
private boolean tcpNoDelay; //设置TCP连接是否延迟
private int pingInterval; //ping服务端的间隔

在Client中，因为与服务器的每一次连接不仅会产生网络延迟，而且也会占用大量的系统资源，所以在Client内部设计了一个连接池，用来缓存与不同服务端的连接，通过对象ConnectionId来表示每一连接。当然，每一个连接有一个最大空闲期maxIdleTime，如果一个连接在时间maxIdleTime内没有被使用的话，该连接将自动关闭与Server的连接，以此来释放该连接在服务器端和客户端的系统资源。这个最大空闲期maxIdleTime的值可以通过客户端的配置文件来设置，对应的配置项为：ipc.client.connection.maxidletime。同时为了维护该连接的有效性，该连接设置了基于TCP的Socket的网络超时时间，当该连接发生SocketTimeoutException时，会自动的向服务器端发送ping包，来测试当前客户端与服务器端的连接是否正常，这个超时时间的值为pingInterval，该值的默认大小是60000ms，不过也可以通过客户端的配置文件来配置，对应的配置项为：ipc.ping.interva l。另外，当该连接向服务器发起连接请求失败的时候，可以不断的重新尝试，尝试的次数由maxRetries决定，当尝试的次数超过该值时，就将该连接视为彻底的失败，客户端的这一次RPC也就失败了。maxRetries默认的值为10，但也可以客户端的配置文件来配置，对应的配置选项为：ipc.client.connect.max.retries。底层的基于TCP的Socket网络连接还可以通过配置文件来设置是否延迟，对应的配置项为：ipc.client.tcpnodelay。其实，对于上面Client内部的四个参数，我们可以根据具体的应用场景来设置适当的值，已达到提高Hadoop集群性能的目的。当一个RPC成果返回之后，Client还需要把此次调用的返回结果解析成用户真正需要的数据类型(毕竟，网络返回的都是0/1序列)，所以Cleint在其内部还需要一个解析器，该解析器的类型为valuesClass，在hadoop的0.20.2.0版本中，这个解析器的类型为ObjectWritable。但是令人不解的是，每一次RPC调用返回之后，都会利用JDK的反射机制来创建该类的一个实例，然后利用这个解析器实例来解析返回结果，这样做的后果想必熟悉JDK反射机制的人都很清除了。

相关问答

在客户端和服务器端编程语言相同时，IDL在RPC中的作用？(Role of IDL in RPC when programming languages the same on client and server side?)[2023-12-04]

取决于语言，它是否有任何内置设施来编组参数和方法和对象标识符。 C / C ++没有内置的这种支持，所以我们有例如用于定义COM接口的MIDL。编译它将创建用于将方法调用语义转换为/从IPC / RPC消息传递的代理和存根代码。编译高级语言可能会产生足以在运行时生成封送处理的反射元数据，因此编程语言是 IDL。 Depends on the language, whether it has any built-in facility for marshaling arguments and method ...
JSON-RPC客户端在C＃中的示例代码(Sample code for JSON-RPC client in C#)[2023-10-12]

2个样品在这里有两种不同的实现。阅读整个线程+检查附件 2 Samples here There are two different implementations. Read the whole thread + check the attachments
如何在Windows上实现RPC客户端(How to implement RPC client on Windows)[2023-09-22]

Windows有它自己的RPC版本。它仅在Windows盒子之间正常工作。为了互操作性，您需要查看CORBA或WCF。 Windows has its own version of RPC. It works fine between Windows boxes only. For interoperability you need to look at CORBA or WCF.
获取RPC客户端的进程名称(Get Process Name for RPC Client)[2023-08-29]

要获取RPC客户端的进程名称，必须使用RpcServerInqCallAttributes查询进程ID，使用进程ID获取进程句柄的OpenProcess ，以及使用进程句柄获取完整进程名称的QueryFullProcessImageName 。 To get the process name for an RPC client, you must use RpcServerInqCallAttributes to query the process ID, OpenProcess with the proc ...
多客户端RPC(Multi client RPC)[2021-11-30]

如果我理解正确，您希望多个客户端注册相同的过程，然后在其中调用特定的过程。 WAMP使用相同的URI进行此过程是不可能的。对于您想要做的事情，预期的方法是使用包含客户端ID的URI，例如，如果您的过程是“com.example.calculate_load”，则客户端将注册“com.example.client_1.calculate_load”（或“com。 example.calculate_load.client_1“），您将通过过程URI寻址客户端。多个客户端可以在相同的URI下注册相同的过程， ...
Go中来自客户端和服务器的RPC(RPC from both client and server in Go)[2022-12-27]

我目前正在使用thrift（ thrift4go ）来实现server-> client和client-> server RPC功能。默认情况下，thrift只执行客户端 - >服务器调用，就像net / rpc一样。由于我还需要服务器 - >客户端通信，我做了一些研究并找到了比迪烟节。 Bidi-thrift解释了如何连接java服务器+ java客户端以进行双向thrift通信。什么比迪烟节俭，它的局限性。 TCP连接具有接收和输出通信线路（RC和TX）。 bidi-thrift的想法是拆分RS ...
使用RPC从客户端向其他客户端发送消息(Using RPC to send a message from a client to other clients)[2021-11-28]

按照John Bollinger最后的评论：“ （...）服务器只能通过客户端对RPC调用的响应来中继消息。”（......） “ 所以基本上不会，客户不能直接与其他客户进行沟通。他们可以通过调用服务器来发送和询问信息，并且通过这些请求可以从一个客户端“通信”到另一个客户端。 As per John Bollinger's very helpful last comment: "(...) the server can relay messages only via its responses to RP ...
Flink错误 - org.apache.hadoop.ipc.RemoteException：服务器IPC版本9无法与客户端版本4通信(Flink error - org.apache.hadoop.ipc.RemoteException: Server IPC version 9 cannot communicate with client version 4)[2023-06-14]

你有没有尝试过Flink的Hadoop-2版本？看看下载页面。有一个名为flink-0.9.0-milestone-1-bin-hadoop2.tgz应该可以与Hadoop 2一起使用。 Have you tried the Hadoop-2 build of Flink? Have a look at the downloads page. There is a build called flink-0.9.0-milestone-1-bin-hadoop2.tgz that should work ...
在hadoop webhdfs客户端中附加操作(append operation in hadoop webhdfs client)[2023-11-17]

假设您的用户名是hdfs ，请将&user.name=hdfs添加到您的URL。写操作需要有效用户。您的Java代码有效，因为它从unix环境中提取您的用户信息。如果您在任何地方看到用户dr.who ，可能是因为您没有在请求中设置user.name 。 Assuming your user name is hdfs, add &user.name=hdfs to your URL. Write operations require a valid user. Your java code works ...
如何在thrift python客户端中设置rpc超时？(How to set rpc timeout in thrift python client?)[2024-03-17]

您可以使用socket.setTimeout()方法。 from thrift.transport.THttpClient import THttpClient socket = THttpClient(server_url) socket.setTimeout(SERVICE_TIMEOUT_IN_mS) transport = TTransport.TBufferedTransport(socket) protocol = TBinaryProtocol.TBinaryProtocol(transpor ...

知识点

相关文章

最近更新

Hadoop中的RPC实现——客户端通信组件

相关问答

在客户端和服务器端编程语言相同时，IDL在RPC中的作用？(Role of IDL in RPC when programming languages the same on client and server side?)[2023-12-04]

JSON-RPC客户端在C＃中的示例代码(Sample code for JSON-RPC client in C#)[2023-10-12]

如何在Windows上实现RPC客户端(How to implement RPC client on Windows)[2023-09-22]

获取RPC客户端的进程名称(Get Process Name for RPC Client)[2023-08-29]

多客户端RPC(Multi client RPC)[2021-11-30]

Go中来自客户端和服务器的RPC(RPC from both client and server in Go)[2022-12-27]

使用RPC从客户端向其他客户端发送消息(Using RPC to send a message from a client to other clients)[2021-11-28]

Flink错误 - org.apache.hadoop.ipc.RemoteException：服务器IPC版本9无法与客户端版本4通信(Flink error - org.apache.hadoop.ipc.RemoteException: Server IPC version 9 cannot communicate with client version 4)[2023-06-14]

在hadoop webhdfs客户端中附加操作(append operation in hadoop webhdfs client)[2023-11-17]

如何在thrift python客户端中设置rpc超时？(How to set rpc timeout in thrift python client?)[2024-03-17]